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SEQUENCE LISTING 



<110> BASF AG 

<120> Method for producing carotenoids or their precursors 



using genetically modified organisms of the Blakeslea genus, 
carotenoids or their precursors produced by said method and 
use thereof 



<130> BASF/NAE87 7/03 
<160> 80 

<170> Patentln version 3.2 

<210> 1 

<211> 2160 

<212> DNA 

<213> Artificial 

<220> 

<223> Promoter 
<400> 1 

ctttcgacac tgaaatacgt cgagcctgct ccgcttggaa gcggcgagga gcctcgtcct 
gtcacaacta ccaacatgga gtacgataag ggccagttcc gccagctcat taagagccag 
ttcatgggcg ttggcatgat ggccgtcatg catctgtact tcaagtacac caacgctctt 
ctgatccagt cgatcatccg ctgaaggcgc tttcgaatct ggttaagatc cacgtcttcg 
ggaagccagc gactggtgac ctccagcgtc cctttaaggc tgccaacagc tttctcagcc 
agggccagcc caagaccgac aaggcctccc tccagaacgc cgagaagaac tggaggggtg 
gtgtcaagga ggagtaagct ccttattgaa gtcggaggac ggagcggtgt caagaggata 
ttcttcgact ctgtattata gataagatga tgaggaattg gaggtagcat agcttcattt 



60 
120 
180 
240 
300 
360 
420 
480 



ggatttgctt tccaggctga gactctagct tggagcatag agggtccttt ggctttcaat 



540 
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attctcaagt atctcgagtt tgaacttatt ccctgtgaac cttttattca ccaatgagca 600 

ttggaatgaa catgaatctg aggactgcaa tcgccatgag gttttcgaaa tacatccgga 660 

tgtcgaaggc ttggggcacc tgcgttggtt gaatttagaa cgtggcacta ttgatcatcc 720 

gatagctctg caaagggcgt tgcacaatgc aagtcaaacg ttgctagcag ttccaggtgg 780 

aatgttatga tgagcattgt attaaatcag gagatatagc atgatctcta gttagctcac 840 

cacaaaagtc agacggcgta accaaaagtc acacaacaca agctgtaagg atttcggcac 900 

ggctacggaa gacggagaag ccaccttcag tggactcgag taccatttaa ttctatttgt 960 

gtttgatcga gacctaatac agcccctaca acgaccatca aagtcgtata gctaccagtg 1020 

aggaagtgga ctcaaatcga cttcagcaac atctcctgga taaactttaa gcctaaacta 1080 

tacagaataa gataggtgga gagcttatac cgagctccca aatctgtcca gatcatggtt 1140 

gaccggtgcc tggatcttcc tatagaatca tccttattcg ttgacctagc tgattctgga 1200 

gtgacccaga gggtcatgac ttgagcctaa aatccgccgc ctccaccatt tgtagaaaaa 1260 

tgtgacgaac tcgtgagctc tgtacagtga ccggtgactc tttctggcat gcggagagac 1320 

ggacggacgc agagagaagg gctgagtaat aagccactgg ccagacagct ctggcggctc 1380 

tgaggtgcag tggatgatta ttaatccggg accggccgcc cctccgcccc gaagtggaaa 14 40 

ggctggtgtg cccctcgttg accaagaatc tattgcatca tcggagaata tggagcttca 1500 

tcgaatcacc ggcagtaagc gaaggagaat gtgaagccag gggtgtatag ccgtcggcga 1560 

aatagcatgc cattaaccta ggtacagaag tccaattgct tccgatctgg taaaagattc 1620 

acgagatagt accttctccg aagtaggtag agcgagtacc cggcgcgtaa gctccctaat 1680 

tggcccatcc ggcatctgta gggcgtccaa atatcgtgcc tctcctgctt tgcccggtgt 1740 
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atgaaaccgg aaaggccgct caggagctgg ccagcggcgc .agaccgggaa cacaagctgg 1800 

cagtcgaccc atccggtgct ctgcactcga cctgctgagg tccctcagtc cctggtaggc 1860 

agctttgccc cgtctgtccg cccggtgtgt cggcggggtt gacaaggtcg ttgcgtcagt 1920 

ccaacatttg ttgccatatt ttcctgctct ccccaccagc tgctcttttc ttttctcttt 1980 

cttttcccat cttcagtata ttcatcttcc catccaagaa cctttatttc ccctaagtaa 2040 

gtactttgct acatccatac tccatccttc ccatccctta ttcctttgaa cctttcagtt 2100 

cgagctttcc cacttcatcg cagcttgact aacagctacc ccgcttgagc agacatcacc 2160 

<210> 2 

<211> 774 

<212> DNA 

<213> Artificial 

<220> 

<223> Terminator 



<220> 

<221> misc_f eature 

<222> (267) . . (267) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (475) . . (475) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (566) . . (566) 

<223> n is a, c, g, or t 



<400> 2 

cgatccactt aacgttactg aaatcatcaa acagcttgac gaatctggat ataagatcgt 



60 
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tggtgtcgat gtcagctccg gagttgagac aaatggtgtt caggatctcg ataagatacg 120 

ttcatttgtc caagcagcaa agagtgcctt ctagtgattt aatagctcca tgtcaacaag 180 

aataaaacgc gttttcgggt ttacctcttc cagatacagc tcatctgcaa tgcattaatg 240 

cattgactgc aacctagtaa cgccttncag gctccggcga agagaagaat agcttagcag 300 

agctattttc attttcggga gacgagatca agcagatcaa cggtcgtcaa gagacctacg 360 

agactgagga atccgctctt ggctccacgc gactatatat ttgtctctaa ttgtactttg 420 

acatgctcct cttctttact ctgatagctt gactatgaaa attccgtcac cagcncctgg 480 

gttcgcaaag ataattgcat gtttcttcct tgaactctca agcctacagg acacacattc 540 

atcgtaggta taaacctcga aatcanttcc tactaagatg gtatacaata gtaaccatgc 600 

atggttgcct agtgaatgct ccgtaacacc caatacgccg gccgaaactt ttttacaact 660 

ctcctatgag tcgtttaccc agaatgcaca ggtacacttg tttagaggta atccttcttt 720 

ctagctagaa gtcctcgtgt actgtgtaag cgcccactcc acatctccac tcga 774 

<210> 3 
<211> 15739 
<212> DNA 
<213> Artificial 

<220> 

<223> Vector 



<220> 

<221> misc_feature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 
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<221> misc__f eature 
<222> (3679) . . (3679) 
<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3770) . . (3770) 

<223> n is- a, c, g, or t 

<400> 3 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 



atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 



180 



tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 



cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 
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tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

t.catcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 
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accatgcctg aactcaccgc 
gtctccgacc tgatgcagct 
ggagggcgtg gatatgtcct 
tatgtttatc ggcactttgc 
gaattcagcg agagcctgac 
gacctgcctg aaaccgaact 
atcgctgcgg ccgatcttag 
ggtcaataca ctacatggcg 
tggcaaactg tgatggacga 
atgctttggg ccgaggactg 
aacaatgtcc tgacggacaa 
ttcggggatt cccaatacga 
atggagcagc agacgcgctra 
ctccgggcgt atatgctccg 
aatttcgatg atgcagcttg 
gggactgtcg ggcgtacaca 
gtagaagtac tcgccgatag 
tagagtagat gccgaccgcg 
tgacgaatct ggatataaga 
tgttcaggat ctcgataaga 
atttaatagc tccatgtcaa 
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gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 24 60 

gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tcgttggtgt cgatgt.cagc tccggagttg agacaaatgg 3300 

tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

caagaataaa acgcgttttc gggtttacct cttccagata 3420 
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cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 34 80 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caagcttggc gtaatcatgg tcatagctgt 4020 

ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 4080 

agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 4140 

tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 4200 

cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca accgattgag 4260 

ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca ccgtcaccga 4320 

cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta ccattagcaa 4380 

ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta gcgacagaat 4440 

caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg gtcatagccc 4500 

ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag ccaccaccgg 4560 

aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca ccaccctcag 4 620 
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agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc ccgatctagt 4 680 

aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct 4740 

atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac 4800 

gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata 4860 

atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa 4920 

cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag 4 980 

atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt gatgtggacg 5040 

ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc cgttgctgtc 5100 

gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc gaagaactcc 5160 

agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac gattccgaag 5220 

cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc 5280 

gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca agaaggcgat 5340 

agaaggcgat gcgctgcgara tcgggagcgg cgataccgta aagcacgagg aagcggtcag 5400 

cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg tcctgatagc 5460 

ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca ttttccacca 5520 

tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg tcgggcatgc 5580 

gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct tcgtccagat 5640 

catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg 5700 

cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc attgcatcag 5760 

ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc tgccccggca 5820 

cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc acagctgcgc 5880 
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aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc agttcattca 5940 

gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga 6000 

acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg aatagcctct 6060 

ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc 6120 

cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt ggataccgag 6180 

gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta gctgatagtg 6240 

accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt agctcattaa 6300 

actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca gttccaaacg 6360 

taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt aattctccgc 6420 

tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg acaggatata 6480 

ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat ttaaaagggc 6540 

gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg gttccccaga 6600 

tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat ccgacagcgc 6660 

gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca gaatgccata 6720 

gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca gcaccggcat 6780 

aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga tcaggggtat 6840 

gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa cgcgcggatt 6900 

ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt gtcaagcatg 6960 

acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa cgaggtcggc 7020 

gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca gccggcgctt 7080 
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tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc catgctggcg 7140 

gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt tctgatcggg 7200 

aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg catccatgcc 7260 

ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg cttcctctgc 7320 

gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag ctacttcact 7380 

gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg cggcggcacc 7440 

gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt cgacgaagcc 7500 

ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt ggcgaaaagg 7560 

aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc aggaccgctg 7620 

ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct ccccctttcc 7680 

accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc cctagcgtcc 7740 

aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc gcttcctcgc 7800 

tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7860 

cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7920 

gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7980 

gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 8040 

gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 8100 

ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcttttcc 8160 

gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc atcctttttc 8220 

gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg tatccaacgg 8280 

cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc cttcttcact 8340 
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gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg ctggccggct 8400 

accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc aaccaggaag 84 60 

ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat tgaggaaaag 8520 

gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca gggctacaaa 8580 

atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa tggcgacctg 8640 

ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac ggcgcggttc 8700 

ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga cgagcttggc 8760 

aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta gccgctaaaa 8820 

cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca agaagagcga 8880 

cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc gcctttgcga 8940 

cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc cctgcaaacg 9000 

cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt tgtggatacc 9060 

tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact tgaggggccg 9120 

actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg gcgacgtgga 9180 

gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc ccacagatga 9240 

tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc gcgactactg 9300 

acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga tgaggggcgc 9360 

acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc aagggtttcc 9420 

gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca atatttataa 9480 

accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg aaggggggtg 9540 
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cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc ccaggggctg 9600 

cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt ccttgccatt 9660 

gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc cggaagcatt 9720 

gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag tgagggcggc 9780 

ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga cttcatggcg 9840 

gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc cgtgctcgtg 9900 

ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt ataccgaggt 9960 

atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat ttaaaaagct 10020 

accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat attgacaata 10080 

ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga tttcaggggg 10140 

caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca taaaaacttg 10200 

catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt ctatcataat 10260 

tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc gatgactttg 10320 

tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg tgccaggtgc 10380 

tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct gattacgtgc 10440 

agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca tatcaccacg 10500 

tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg ttcaccgaat 10560 

acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca gcgctggcgc 10620 

gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat gacgtcactg 10680 

cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga cgtaaaatcg 10740 

tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca ttcatggcca 10800 
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tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac tgcagttgcc 10860 

atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt ttgccgttac 10920 

gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa gccactggag 10980 

cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc cataattgtg 11040 

gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac aactttgaaa 11100 

aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg gagttcgtct 11160 

tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa ggaaataata 11220 

aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat accgctgcgt 11280 

aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag aaaatgaaaa 11340 

cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg tggaacggga 11400 

aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc tgcactttga 114 60 

acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc tttgctcgga 11520 

agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg agtgcatcag 11580 

gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag acagccgctt 11640 

agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg aaaactggga 11700 

agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga cggaaaagcc 11760 

cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct ttgtgaaaga 11820 

tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca agtggtatga 11880 

cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt atgtcgagct 11940 

attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt atattttact 12000 
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ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag caggagcgca 12060 

ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc aagtatttgg 12120 

gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac gagaaggacg 12180 

gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg gacaccaagg 12240 

caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc ggggcaatcc 12300 

cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa gaactgatcg 12360 

acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc atgcgtgcgc 12420 

cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc aagatcgagc 12480 

gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc gtggagcgtt 12540 

cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc gacacgcgag 12600 

gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa caggtcagcg 12660 

aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa atgcagcttt 12720 

ccttgttcga tattgcgccrg tggccggaca cgatgcgagc gatgccaaac gacacggccc 12780 

gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg caaaacaagg 12840 

tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag ctgcgggccg 12900 

acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc cctatcggcg 12960 

agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg atcaatggcc 13020 

ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg atgggcttca 13080 

cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc cgcgtcctgg 13140 

accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc gtcgtgctgt 13200 

ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg tcgccgacgg 13260 
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cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc aagctggaaa 13320 

ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc gagcaggtcg 13380 

gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg gtcaatgatg 134 40 

acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg ggttcagcag 13500 

ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact tgcttcgctc 13560 

agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag gattaaaatt 13620 

gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc aggatttccg 13680 

cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg tttacgagca 13740 

cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg tggcattcgg 13800 

cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg acggccccaa 13860 

ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc gaggccgagg 13920 

ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga tgatcgtccg 13980 

acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac ttaatatttc 14040 

gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg tcgcggcgac 14100 

ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc taggtagccc 14160 

gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg cgctgttggt 14220 

gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg cgggggcggt 14280 

ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc ctctgctcac 14340 

ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag ctttagtgtt 14400 

tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt ggctcggcct 14460 
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gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac tcgaacctac 14520 

agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc cggggatgca 14580 

tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag caatggatag 14640 

gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc ttcctcagcg 14700 

gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca gcctgtcacg 14760 

gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg agatgatatt 14820 

tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct ccgcgagatc 14880 

atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc ggtaacatga 14940 

gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact gatgggctgc 15000 

ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg ctggctggtg 15060 

gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac acattgcgga 15120 

cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa cagctgattg 15180 

cccttcaccg cctggcccfg agagagttgc agcaagcggt ccacgctggt ttgccccagc 15240 

aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat aaatcaaaag 15300 

aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga 15360 

acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg 15420 

aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc 15480 

ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg 15540 

aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg ggaagggcga 15600 

tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 15660 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 15720 



BASF AG 

BASF NAE 877/03 



18/365 



January 08, 2004 



ttcgagctcg gtacccggg 



15739 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221>. 
<222> 
<223> 

<400> 



4 

11611 
DNA 

Artificial 
Vector 



mis cofeature 
(227) . . (227) 
n is a, c, g, or t 

misc_f eature 
(318) . . (318) 
n is a, c, g, or t 

misc_feature " 
(526) . . (526) 
n is a, c, g, or t 

mi sc_f eature 
(8946) . . (8946) 
n is a, c, g, or t 

misc_f eature 
(10028) . . (10028) 
n is a, c, g, or t 



agcttgcatg cctgcaggtc gagtggagat gtggagtggg cgcttacaca gtacacgagg 



60 
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acttctagct agaaagaagg attacctcta aacaagtgta cctgtgcatt ctgggtaaac 120 

gactcatagg agagttgtaa aaaagtttcg gccggcgtat tgggtgttac ggagcattca 180 

ctaggcaacc atgcatggtt actattgtat accatcttag taggaantga tttcgaggtt 240 

tatacctacg atgaatgtgt gtcctgtagg cttgagagtt caaggaagaa acatgcaatt 300 

atctttgcga acccaggngc tggtgacgga attttcatag tcaagctatc agagtaaaga 360 

agaggagcat gtcaaagtac aattagagac aaatatatag tcgcgtggag ccaagagcgg 420 

attcctcagt ctcgtaggtc tcttgacgac cgttgatctg cttgatctcg tctcccgaaa 480 

atgaaaatag ctctgctaag ctattcttct cttcgccgga gcctgnaagg cgttactagg 540 

ttgcagtcaa tgcattaatg cattgcagat gagctgtatc tggaagaggt aaacccgaaa 600 

acgcgtttta ttcttgttga catggagcta ttaaatcact agaaggcact ctttgctgct 660 

tggacaaatg aacgtatctt atcgagatcc tgaacaccat ttgtctcaac tccggagctg 720 

acatcgacac caacgatctt atatccagat tcgtcaagct gtttgatgat ttcagtaacg 780 

ttaagtggat cgatcccgcg gtcggcatct actctattcc tttgccctcg gacgagtgct 840 

ggggcgtcgg tttccactat cggcgagtac ttctacacag ccatcggtcc agacggccgc 900 

gcttctgcgg gcgatttgtg tacgcccgac agtcccggct ccggatcgga cgattgcgtc 960 

gcatcgaccc tgcgcccaag ctgcatcatc gaaattgccg tcaaccaagc tctgatagag 1020 

ttggtcaaga ccaatgcgga gcatatacgc ccggagccgc ggcgatcctg caagctccgg 1080 

atgcctccgc tcgaagtagc gcgtctgctg ctccatacaa gccaaccacg gcctccagaa 1140 

gaagatgttg gcgacctcgt attgggaatc cccgaacatc gcctcgctcc agtcaatgac 1200 

cgctgttatg cggccattgt ccgtcaggac attgttggag ccgaaatccg cgtgcacgag 1260 

gtgccggact tcggggcagt cctcggccca aagcatcagc tcatcgagag cctgcgcgac 1320 
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ggacgcactg acggtgtcgt ccatcacagt ttgccagtga tacacatggg gatcagcaat 1380 

cgcgcatatg aaatcacgcc atgtagtgta ttgaccgatt ccttgcggtc cgaatgggcc 1440 

gaacccgctc gtctggctaa gatcggccgc agcgatcgca tccatggcct ccgcgaccgg 1500 

ctgcagaaca gcgggcagtt cggtttcagg caggtcttgc aacgtgacac cctgtgcacg 1560 

gcgggagatg caataggtca ggctctcgct gaattcccca atgtcaagca cttccggaat 1620 

cgggagcgcg gccgatgcaa agtgccgata aacataacga tctttgtaga aaccatcggc 1680 

gcagctattt acccgcagga catatccacg ccctcctaca tcgaagctga aagcacgaga 1740 

ttcttcgccc tccgagagct gcatcaggtc ggagacgctg tcgaactttt cgatcagaaa 1800 

cttctcgaca gacgtcgcgg tgagttcagg catggtgatg tctgctcaag cggggtagct 1860 

gttagtcaag ctgcgatgaa gtgggaaagc tcgaactgaa aggttcaaag gaataaggga 1920 

tgggaaggat ggagtatgga tgtagcaaag tacttactta ggggaaataa aggttcttgg 1980 

atgggaagat gaatatactg aagatgggaa aagaaagaga aaagaaaaga gcagctggtg 2040 

gggagagcag gaaaatatgg caacaaatgt tggactgacg caacgacctt gtcaaccccg 2100 

ccgacacacc gggcggacag acggggcaaa gctgcctacc agggactgag ggacctcagc 2160 

aggtcgagtg cagagcaccg gatgggtcga ctgccagctt gtgttcccgg tctgcgccgc 2220 

tggccagctc ctgagcggcc tttccggttt catacaccgg gcaaagcagg agaggcacga 2280 

tatttggacg ccctacagat gccggatggg ccaattaggg agcttacgcg ccgggtactc 2340 

gctctaccta cttcggagaa ggtactatct cgtgaatctt ttaccagatc ggaagcaatt 2400 

ggacttctgt acctaggtta atggcatgct atttcgccga cggctataca cccctggctt 24 60 

cacattctcc ttcgcttact gccggtgatt cgatgaagct ccatattctc cgatgatgca 2520 
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atagattctt ggtcaacgag gggcacacca gcctttccac ttcggggcgg aggggcggcc 2580 

ggtcccggat taataatcat ccactgcacc tcagagccgc cagagctgtc tggccagtgg 2640 

cttattactc agcccttctc tctgcgtccg tccgtctctc cgcatgccag aaagagtcac 2700 

cggtcactgt acagagctca cgagttcgtc acatttttct acaaatggtg gaggcggcgg 2760 

attttaggct caagtcatga ccctctgggt cactccagaa tcagctaggt caacgaataa 2820 

ggatgattct ataggaagat ccaggcaccg gtcaaccatg atctggacag atttgggagc 2880 

tcggtataag ctctccacct atcttattct gtatagttta ggcttaaagt ttatccagga 2940 

gatgttgctg aagtcgattt gagtccactt cctcactggt agctatacga ctttgatggt 3000 

cgttgtaggg gctgtattag gtctcgatca aacacaaata gaattaaatg gtactcgagt 3060 
ccactgaagg tggcttctcc gtcttccgta gccgtgccga aatccttaca gcttgtgttg . 3120 

tgtgactttt ggttacgccg tctgactttt gtggtgagct aactagagat catgctatat 3180 

ctcctgattt aatacaatgc tcatcataac attccacctg gaactgctag caacgtttga 3240 

cttgcattgt gcaacgccct ttgcagagct atcggatgat caatagtgcc acgttctaaa 3300 

ttcaaccaac gcaggtgccc caagccttcg acatccggat gtatttcgaa aacctcatgg 3360 

cgattgcagt cctcagattc atgttcattc caatgctcat tggtgaataa aaggttcaca 3420 

gggaataagt tcaaactcga gatacttgag aatattgaaa gccaaaggac cctctatgct 3480 

ccaagctaga gtctcagcct ggaaagcaaa tccaaatgaa gctatgctac ctccaattcc 3540 

tcatcatctt atctataata cagagtcgaa gaatatcctc ttgacaccgc tccgtcctcc 3600 

gacttcaata aggagcttac tcctccttga caccacccct ccagttcttc tcggcgttct 3660 

ggagggaggc cttgtcggtc ttgggctggc cctggctgag aaagctgttg gcagccttaa 3720 
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agggacgctg gaggtcacca gtcgctggct 

aaagcgcctt cagcggatga tcgactggat 

atgcatgacg gccatcatgc caacgcccat 

gcccttatcg tactccatgt tggtagttgt 

cggagcaggc tcgacgtatt tcagtgtcga 

gtttcgcatg attgaacaag atggattgca 

gctattcggc tatgactggg cacaacagac 

gctgtcagcg caggggcgcc cggttctttt 

tgaactgcag gacgaggcag cgcggctatc 

agctgtgctc gacgttgtca ctgaagcggg 

ggggcaggat ctcctgtcat ctcaccttgc 

tgcaatgcgg cggctgcata cgcttgatcc 

acatcgcatc gagcgagca-c gtactcggat 

ggacgaagag catcaggggc tcgcgccagc 

gcccgacggc gaggatctcg tcgtgaccca 

ggaaaatggc cgcttttctg gattcatcga 

tcaggacata gcgttggcta cccgtgatat 

ccgcttcctc gtgctttacg gtatcgccgc 

ccttcttgac gagttcttct gagcgggact 

cccaacctgc catcacgaga tttcgattcc 
ggaatcgttt tccgggacgc cggctggatg 
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tcccgaagac gtggatctta accagattcg 3780 

cagaagagcg ttggtgtact tgaagtacag 3840 

gaactggctc ttaatgagct ggcggaactg 3900 

gacaggacga ggctcctcgc cgcttccaag 3960 

aagatctgat caagagacag gatgaggatc 4020 

cgcaggttct ccggccgctt gggtggagag 4080 

aatcggctgc tctgatgccg ccgtgttccg 4140 

tgtcaagacc gacctgtccg gtgccctgaa 4200 

gtggctggcc acgacgggcg ttccttgcgc 4260 

aagggactgg ctgctattgg gcgaagtgcc 4320 

tcctgccgag aaagtatcca tcatggctga 4380 

ggctacctgc ccattcgacc accaagcgaa 4440 

ggaagccggt cttgtcgatc aggatgatct 4500 

cgaactgttc gccaggctca aggcgcgcat 4560 

tggcgatgcc tgcttgccga atatcatggt 4620 

ctgtggccgg ctgggtgtgg cggaccgcta 4 680 

tgctgaagag cttggcggcg aatgggctga 4740 

tcccgattcg cagcgcatcg ccttctatcg 4800 

ctggggttcg aaatgaccga ccaagcgacg 4860 

accgccgcct tctatgaaag gttgggcttc 4920 

atcctccagc gcggggatct catgctggag 4980 
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ttcttcgccc accccgggct cgatcccctc gcgagttggt tcagctgctg cctgaggctg 5040 

gacgacctcg cggagttcta ccggcagtgc aaatccgtcg gcatccagga aaccagcagc 5100 

ggctatccgc gcatccatgc ccccgaactg caggagtggg gaggcacgat ggccgctttg 5160 

gtccggatct ttgtgaagga accttacttc tgtggtgtga cataattgga caaactacct 5220 

acagagattt aaagctctaa ggtaaatata aaatttttaa gtgtataatg tgttaaacta 5280 

ctgattctaa ttgtttgtgt attttagatt ccaacctatg gaactgatga atgggagcag 5340 

tggtggaatg cctttaatga ggaaaacctg ttttgctcag aagaaatgcc atctagtgat 5400 

gatgaggcta ctgctgactc tcaacattct actcctccaa aaaagaagag aaaggtagaa 5460 

gaccccaagg actttccttc agaattgcta agttttttga gtcatgctgt gtttagtaat 5520 

agaactcttg cttgctttgc tatttacacc acaaaggaaa aagctgcact gctatacaag 5580 

aaaattatgg aaaaatattc tgtaaccttt ataagtaggc ataacagtta taatcataac 5640 

atactgtttt ttcttactcc acacaggcat agagtgtctg ctattaataa ctatgctcaa 5700 

aaattgtgta cctttagctt tttaatttgt aaaggggtta ataaggaata tttgatgtat 5760 

agtgccttga ctagagatca taatcagcca taccacattt gtagaggttt tacttgcttt 5820 

aaaaaacctc ccacacctcc ccctgaacct gaaacataaa atgaatgcaa ttgttgttgt 5880 

taacttgttt attgcagctt ataatggtta caaataaagc aatagcatca caaatttcac 5940 

aaataaagca tttttttcac tgcattctag ttgtggtttg tccaaactca tcaatgtatc 6000 

ttatcatgtc tggatctgac gggtgcgcat gatcgtgctc ctgtcgttga ggacccggct 6060 

aggctggcgg ggttgcctta ctggttagca gaatgaatca ccgatacgcg agcgaacgtg 6120 

aagcgactgc tgctgcaaaa cgtctgcgac ctgagcaaca acatgaatgg tcttcggttt 6180 
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ccgtgtttcg taaagtctgg aaacgcggaa gtcagcgctc ttccgcttcc tcgctcactg 6240 

actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca • aaggcggtaa 6300 

tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca aaaggccagc 6360 

aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag 6420 

gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc 6480 

gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt 6540 

tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct 6600 

ttctcatagc tcacgctgta ggtatctoag ttcggtgtag gtcgttcgct ccaagctggg 6660 

ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta actatcgtct 6720 

tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg gtaacaggat 6780 

tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggc ctaactacgg 6840 

ctacactaga aggacagtat ttggtatctg cgctctgctg aagccagtta ccttcggaaa 6900 

aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt 6960 

ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc 7020 

tacggggtct gacgctcagt ggaacgaaaa ctcacgttaa gggattttgg tcatgagatt 7080 

atcaaaaagg atcttcacct agatcctttt aaattaaaaa tgaagtttta aatcaatcta 7140 

aagtatatat gagtaaactt ggtctgacag ttaccaatgc ttaatcagtg aggcacctat 7200 

ctcagcgatc tgtctatttc gttcatccat agttgcctga ctccccgtcg tgtagataac 7260 

tacgatacgg gagggcttac catctggccc cagtgctgca atgataccgc gagacccacg 7320 

ctcaccggct ccagatttat cagcaataaa ccagccagcc ggaagggccg agcgcagaag 7380 
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tggtcctgca actttatccg cctccatcca gtctattaat tgttgccggg aagctagagt 7440 

aagtagttcg ccagttaata gtttgcgcaa cgttgttgcc attgctgcag gcatcgtggt 7500 

gtcacgctcg tcgtttggta tggcttcatt cagctccggt tcccaacgat caaggcgagt 7560 

tacatgatcc cccatgttgt gcaaaaaagc ggttagctcc ttcggtcctc cgatcgttgt 7620 

cagaagtaag ttggccgcag tgttatcact catggttatg gcagcactgc ataattctct 7680 

tactgtcatg ccatccgtaa gatgcttttc tgtgactggt gagtactcaa ccaagtcatt 7740 

ctgagaatag tgtatgcggc gaccgagttg ctcttgcccg gcgtcaacac gggataatac 7800 

cgcgccacat agcagaactt taaaagtgct catcattgga aaacgttctt cggggcgaaa 7860 

actctcaagg atcttaccgc tgttgagatc cagttcgatg taacccactc gtgcacccaa 7920 

ctgatcttca gcatctttta ctttcaccag cgtttctggg tgagcaaaaa caggaaggca 7980 

aaatgccgca aaaaagggaa taagggcgac acggaaatgt tgaatactca tactcttcct . 8040 

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga 8100 

atgtatttag aaaaataaac aaataggggt tccgcgcaca tttccccgaa aagtgccacc 8160 

tgacgtctaa gaaaccatta ttatcatgac attaacctat aaaaataggc gtatcacgag 8220 

gccctttcgt cttcaagaat tcgcggccgc aattaaccct cactaaagga tccctatagt 8280 

gagtcgtatt atgcggccgc gaattctcat gtttgaccgc ttatcatcga taagctctgc 8340 

tttttgttga cttccattgt tcattccacg gacaaaaaca gagaaaggaa acgacagagg 8400 

ccaaaaagct cgctttcagc acctgtcgtt tcctttcttt tcagagggta ttttaaataa 84 60 

aaacattaag ttatgacgaa gaagaacgga aacgccttaa accggaaaat tttcataaat 8520 

agcgaaaacc cgcgaggtcg ccgccccgta acaaggcgga tcgccggaaa ggacccgcaa 8580 

atgataataa ttatcaattg catactatcg acggcactgc tgccagataa caccaccggg 8640 
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gaaacattcc atcatgatgg ccgtgcggac ataggaagcc agttcatcca tcgctttctt 8700 

gtctgctgcc atttgctttg tgacatccag cgccgcacat tcagcagcgt ttttcagcgc 8760 

gttttcgatc aacgtttcaa tgttggtatc aacaccaggt ttaactttga acttatcggc 8820 

actgacggtt accttgttct gcgctggctc atcacgcagg ataccaaggc tgatgttgta 8880 

gatattggtc accggctgag ggttttcgat tgccgctgcg tggatagcac catttgcgat 8940 

caggcngtcc ttgatgaatg acactccatt gcgaataagt tcgaaggaga cggtgtcacg 9000 

aatgcgctgg tccagctcgg tcgattgcct tttgtgcagc agaggtatca atctcaacgc 9060 

caaggctcat cgaagcgcaa tattgctgct caccaaaacg cgtattgacc aggtgttcaa 9120 

cggcaaattt ctgcccttct gatgtcagaa aggcaaagtg attttctttc tggtattcag 9180 

ttgctgtgtg tcggtttcag caaaaccaag ctcgcgcaat tcggctgtgc agatttagaa 9240 

ggcagatcac cagacagcaa cggccaacgg aaaacagcgc atacagaaca tccgtcgccg 9300 

cgccgacaac gtgataattt ttatgaccca tgatttattt ccttttagac gtgagcctgt 9360 

cgcacagcaa agccgccgaa agttcctcga agctagcttc agacgtgtct agatacgtct 9420 

gctttttgtt gacttccatt gttcattcca cggacaaaaa cagagaaagg aaacgacaga 9480 

ggccaaaaag ctcgctttca gcacctgtcg tttcctttct tttcagaggg tattttaaat 9540 

aaaaacatta agttatgacg aagaagaacg gaaacgcctt aaaccggaaa attttcataa 9600 

atagcgaaaa cccgcgaggt cgccgccccg taacaaggcg gatcgccgga aaggacccgc 9660 

aaatgataat. aattatcaat tgcatactat cgacggcact gctgccagat aacaccaccg 9720 

gggaaacatt ccatcatgat ggccgtgcgg acataggaag ccagttcatc catcgctttc 9780 

ttgtctgctg ccatttgctt tgtgacatcc agcgccgcac attcagcagc gtttttcagc 9840 
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gcgttttcga tcaacgtttc aatgttggta tcaacaccag gtttaacttt gaacttatcg 9900 



gcactgacgg ttaccttgtt ctgcgctggc tcatcacgca ggataccaag gctgatgttg 9960 

tagatattgg tcaccggctg agggttttcg attgccgctg cgtggatagc accatttgcg 10020 

atcaggcngt ccttgatgaa tgacactcca ttgcgaataa gttcgaagga gacggtgtca 10080 

cgaatgcgct ggtccagctc ggtcgattgc cttttgtgca gcagaggtat caatctcaac 10140 

gccaaggctc atcgaagcgc aatattgctg ctcaccaaaa cgcgtattga ccaggtgttc 10200 

aacggcaaat ttctgccctt ctgatgtcag aaaggcaaag tgattttctt tctggtattc 10260 

agttgctgtg tgtcggtttc agcaaaacca agctcgcgca attcggctgt gcagatttag 10320 

aaggcagatc accagacagc aacggccaac ggaaaacagc gcatacagaa catccgtcgc 10380 

cgcgccgaca acgtgataat ttttatgacc catgatttat ttccttttag acgtgagcct 10440 

gtcgcacagc aaagccgccg aaagttcctc gaccgatgcc cttgagagcc ttcaacccag 10500 

tcagctcctt ccggtgggcg cggggcatga ctatcgtcgc cgcacttatg actgtcttct 10560 

ttatcatgca actcgtagga caggtgccgg cagcgctctg ggtcattttc ggcgaggacc 10620 

gctttcgctg gagcgcgacg atgatcggcc tgtcgcttgc ggtattcgga atcttgcacg 10680 

ccctcgctca agccttcgtc actggtcccg ccaccaaacg tttcggcgag aagcaggcca 10740 

ttatcgccgg catggcggcc gacgcgctgg gctacgtctt gctggcgttc gcgacgcgag 10800 

gctggatggc cttccccatt atgattcttc tcgcttccgg cggcatcggg atgcccgcgt 10860 

tgcaggccat gctgtccagg caggtagatg acgaccatca gggacagctt caaggatcgc 10920 

tcgcggctct taccagccta acttcgatca ttggaccgct gatcgtcacg gcgatttatg 10980 

ccgcctcggc gagcacatgg aacgggttgg catggattgt aggcgccgcc ctataccttg 11040 



BASF AG 28/365 January 08, 2004 

BASF NAE 877/03 

tctgcctccc cgcgttgcgt cgcggtgcat ggagccgggc cacctcgacc tgaatggaag 11100 

ccggcggcac ctcgctaacg gattcaccac tccaagaatt ggagccaatc aattcttgcg 11160 

gagaactgtg aatgcgcaaa ccaacccttg gcagaacata tccatcgcgt ccgccatctc 11220 

cagcagccgc acgcggcgca tctcgggcag cgttgggtcc tgcagatccg gctgtggaat 11280 

gtgtgtcagt tagggtgtgg aaagtcccca ggctccccag caggcagaag tatgcaaagc 11340 

atgcatctca attagtcagc aaccaggtgt ggaaagtccc caggctcccc agcaggcaga 11400 

agtatgcaaa gcatgcatct caattagtca gcaaccatag tcccgcccct aactccgccc 11460 

atcccgcccc taactccgcc cagttccgcc cattctccgc cccatggctg actaattttt 11520 

tttatttatg cagaggccga ggccgcctcg gcctctgagc tattccagaa gtagtgagga 11580 

ggcttttttg gaggcctagg cttttgcaaa a 11611 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 5 

cgatgtagga gggcgtggat a 21 



<210> 6 

<211> 21 

<212> DNA 

<213> Artificial 



<220> 

<223> Primer 
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<400> 



6 



gcttctgcgg gcgatttgtg t 



21 



<210> 7 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 7 

tgagaatatc accggaattg 20 

<210> 8 

<211> 21 

<212> DNA 

<213> Artificial . 

<220> 

<223> Primer 



<210> 9 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 9 

gtgaatggaa atcccatcgc tgtc 24 



<400> 



8 



agctcgacat actgttcttc c 



21 



<210> 10 
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<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 10 

.agtgggtact ctaaaggcca tacc 24 



<210> 11 
<211> 1771 
<212> DNA 

<213> Haematococcus pluvialis 
<220> 

<221> CDS 

<222> (166) . . (1155) 
<400> 11 

ggcacgagct tgcacgcaag tcagcgcgcg caagtcaaca cctgccggtc cacagcctca 60 

aataataaag agctcaagcg tttgtgcgcc tcgacgtggc cagtctgcac tgccttgaac 120 

ccgcgagtct cccgccgcac tgactgccat agcacagcta gacga atg cag eta gca 177 

Met Gin Leu Ala 
1 

gcg aca gta atg ttg gag cag ctt acc gga age get gag gca etc aag 225 
Ala Thr Val Met Leu Glu Gin Leu Thr Gly Ser Ala Glu Ala Leu Lys 
5 10 15 20 

gag aag gag aag gag- gtt gca ggc age tct gac gtg ttg cgt aca tgg 273 
Glu Lys Glu Lys Glu Val Ala Gly Ser Ser Asp Val Leu Arg Thr Trp 
25 30 35 



gcg acc cag tac teg ctt ccg tea gaa gag tea gac gcg gee cgc ccg 
Ala Thr Gin Tyr Ser Leu Pro Ser Glu Glu Ser Asp Ala Ala Arg Pro 
40 45 50 



321 
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gga ctg aag 
Gly Leu Lys 
55 

aca atg gcg 
Thr Met Ala 
70 

gcc att ttt 
Ala lie Phe 
85 

ctg ccc gtg 
Leu Pro Val 

ctg etc gac 
Leu Leu Asp 

ggc ctt ttt 
Gly Leu Phe 
135 

aga aac agg 
Arg Asn Arg 
150 

tac gcc tgg 
Tyr Ala Trp 
165 

cac aac cac 
His Asn His 

aac cct ggc 
Asn Pro Gly 



aat gcc tac 
Asn Ala Tyr 

eta cgt gtc 
Leu Arg Val 

caa ate aag 
Gin lie Lys 
90 

tea gat gcc 
Ser Asp Ala 
105 

ate gtc gta 
He Val Val 
120 

ate acc acg 
He Thr Thr 

cag ctt aat 
Gin Leu Asn 

ttt gat tac 
Phe Asp Tyr 
170 

act ggc gag 
Thr Gly Glu 
185 

att gtg ccc 
He Val Pro 
200 



aag cca cca 
Lys Pro Pro 
60 

ate ggc tec 
He Gly Ser 
75 

ctt ccg acc 
Leu Pro Thr 



aca get cag 
Thr Ala Gin 



gta ttc ttt 
Val Phe Phe 
125 

cat gat get 
His Asp Ala 
140 

gac ttc ttg 
Asp Phe Leu 
155 

aac atg ctg 
Asn Met Leu 

gtg ggc aag 
Val Gly Lys 



tgg ttt gcc 
Trp Phe Ala 
205 



cct tec gac 
Pro Ser Asp 

tgg gcc gca 
Trp Ala Ala 
80 

tec ttg gac 
Ser Leu Asp 
95 

ctg gtt age 
Leu Val Ser 
110 

gtc ctg gag 
Val Leu Glu 



atg cat ggc 
Met His Gly 

ggc aga gta 
Gly Arg Val 
160 

cac cgc aag 
His Arg Lys 
175 

gac cct gac 
Asp Pro Asp 
190 

age ttc atg 
Ser Phe Met 



aca aag ggc 
Thr Lys Gly 
65 

gtg ttc etc 
Val Phe Leu 



cag ctg cac 
Gin Leu His 



ggc acg age 
Gly Thr Ser 
115 

ttc ctg tac 
Phe Leu Tyr 
130 

acc ate gcc 
Thr He Ala 
145 

tgc ate tec 
Cys He Ser 

cat tgg gag 
His, Trp Glu 

ttc cac agg 
Phe His Arg 
195 

tec age tac 
Ser Ser Tyr 
210 



ate 369 
He 



cac 4 17 

His 

tgg 4 65 

Trp 

100 

age 513 
Ser 

aca 561 
Thr 



atg 609 
Met 

ttg 657 
Leu 

cac 705 

His 

180 

gga 753 
Gly 

atg 801 
Met 
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teg atg tgg cag ttt gcg cgc etc gca tgg tgg acg gtg gtc atg cag 849 
Ser Met Trp Gin Phe Ala Arg Leu Ala Trp Trp Thr Val Val Met Gin 
215 220 225 



ctg ctg ggt gcg cca atg gcg aac ctg ctg gtg ttc atg gcg gee gcg 897 
Leu Leu Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala 
230 235 240 

ccc ate ctg tec gee ttc cgc ttg ttc tac ttt ggc acg tac atg ccc 945 
Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Met Pro 
245 250 255 260 



cac aag cct gag cct ggc gee gcg tea ggc tct tea cca gee gtc atg 993 
His Lys Pro Glu Pro Gly Ala Ala Ser Gly Ser Ser Pro Ala Val Met 
265 270 275 

aac tgg tgg aag teg cgc act age cag gcg tec gac ctg gtc age ttt 1041 
Asn Trp Trp Lys Ser Arg Thr Ser Gin Ala Ser Asp Leu Val Ser Phe 
280 285 290 



ctg ace tgc tac cac ttc gac ctg cac tgg gag cac cac cgc tgg ccc 1089 
Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro 
295 300 305 



ttc gee ccc tgg tgg -gag ctg ccc aac tgc cgc cgc ctg tct ggc cga 1137 
Phe Ala Pro Trp Trp Glu Leu Pro Asn Cys Arg Arg Leu Ser Gly Arg 
310 315 320 



ggt ctg gtt cct gee tag ctggacacac tgcagtgggc cctgctgcca 1185 

Gly Leu Val Pro Ala 

325 



gctgggcatg caggttgtgg caggactggg tgaggtgaaa agetgeagge gctgctgccg 1245 
gacacgctgc atgggctacc ctgtgtagct gccgccacta ggggaggggg tttgtagctg 1305 
tegagcttge cccatggatg aagctgtgta gtggtgcagg gagtacaccc acaggccaac 1365 



acccttgcag gagatgtctt gegtegggag gagtgttggg cagtgtagat gctatgattg 1425 



tatcttaatg ctgaagcett taggggagcg acacttagtg ctgggcaggc aacgccctgc 1485 
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aaggtgcagg cacaagctag gctggacgag gactcggtgg caggcaggtg aagaggtgcg 1545 

ggagggtggt gccacaccca ctgggcaaga ccatgctgca atgctggcgg tgtggcagtg 1605 

agagctgcgt gattaactgg gctatggatt gtttgagcag tctcacttat tctttgatat 1665 

agatactggt caggcaggtc aggagagtga gtatgaacaa gttgagaggt ggtgcgctgc 1725 

ccctgcgctt atgaagctgt aacaataaag tggttcaaaa aaaaaa 1771 

<210> 12 
<211> 329 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 12 

Met Gin Leu Ala Ala Thr Val Met Leu Glu Gin Leu Thr Gly Ser Ala 
15 10 15 



Glu Ala Leu Lys Glu Lys Glu Lys Glu Val Ala Gly Ser Ser Asp Val 
20 - 25 30 



Leu Arg Thr Trp Ala Thr Gin Tyr Ser Leu Pro Ser Glu Glu Ser Asp 
35 40 45 



Ala Ala Arg Pro Gly Leu Lys Asn Ala Tyr Lys Pro Pro Pro Ser Asp 
50 55 60 



Thr Lys Gly lie Thr Met Ala Leu Arg Val lie Gly Ser Trp Ala Ala 
65 70 75 80 



Val Phe Leu His Ala lie Phe Gin lie Lys Leu Pro Thr Ser Leu Asp 
85 90 95 



BASF AG 

BASF NAE 877/03 



34/365 



January 08, 2004 



Gin Leu His Trp 
100 



Gly Thr Ser Ser 
115 



Phe Leu Tyr Thr 
130 



Thr lie Ala Met 
145 



Cys lie Ser Leu 



His Trp Glu His 
180 



Phe His Arg Gly 
195 



Ser Ser Tyr Met 
210 



Val Val Met Gin 
225 



Leu Pro Val Ser 



Leu Leu Asp lie 
120 



Gly Leu Phe lie 
135 



Arg Asn Arg Gin 
150 



Tyr Ala Trp Phe 
165 



His Asn His Thr 



Asn Pro Gly lie 
200 



Ser Met Trp Gin 
215 



Leu Leu Gly Ala 
230 



Asp Ala Thr Ala 
105 



Val Val Val Phe 



Thr Thr His Asp 
140 



Leu Asn Asp Phe 
155 



Asp Tyr Asn Met 
170 



Gly Glu Val Gly 
185 



Val Pro Trp Phe 



Phe Ala Arg Leu 

220 



Pro Met Ala Asn 

235 



Gin Leu Val Ser 
110 



Phe Val Leu Glu 
125 



Ala Met His Gly 



Leu Gly Arg Val 
160 



Leu His Arg Lys 
175 



Lys Asp Pro Asp 
190 



Ala Ser Phe Met 
205 



Ala Trp Trp Thr 



Leu Leu Val Phe 
240 



Met Ala Ala Ala Pro lie Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly 
245 250 255 
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Thr Tyr Met Pro His Lys Pro Glu Pro Gly Ala Ala Ser Gly Ser Ser 



260 



265 



270 



Pro Ala Val Met Asn Trp Trp Lys Ser. Arg Thr Ser Gin Ala Ser Asp 



275 



280 



285 



Leu Val Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp Glu His 



290 



295 



300 



His Arg Trp Pro Phe Ala Pro Trp Trp Glu Leu Pro Asn Cys Arg Arg 



305 



310 



315 



320 



Leu Ser Gly Arg Gly Leu Val Pro Ala 
325. 



<210> 13 
<211> 1662 
<212> DNA 

<213> Haematococcus pluvialis 
<220> 

<221> CDS 

<222> (168) . . (1130) 
<400> 13 

cggggcaact caagaaattc aacagctgca agcgcgcccc agcctcacag cgccaagtga 60 
gctatcgacg tggttgtgag cgctcgacgt ggtccactga cgggcctgtg agcctctgcg 120 
ctccgtcctc tgccaaatct ' cgcgtcgggg cctgcctaag tcgaaga atg cac gtc 176 



Met His Val 



1 
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gca teg gca 
Ala Ser Ala 
5 

age cca gac 
Ser Pro Asp 
20 

gag teg tea 
Glu Ser Ser 

cca gca tct 
Pro Ala Ser 



ace tgg ace 
Thr Trp Thr 
70 

aca tec atg 
Thr Ser Met 
85 

cag ctt ttg 
Gin Leu Leu 
100 

att gta ctt 
He Val Leu 



gca atg cat 
Ala Met His 



ctt ggc aac 
Leu Gly Asn 
150 

ctg cat cgc 



eta atg gtc 
Leu Met Val 

gtc ttg aga 
Val Leu Arg 
25 

gac gca get 
Asp Ala Ala 
40 

gac gee aag 
Asp Ala Lys 
55 

gca gtg ttt 
Ala Val Phe 

gac cag ctt 
Asp Gin Leu 

ggc gga "age 
Gly Gly Ser 
105 

gag ttc ctg 
Glu Phe Leu 
120 

ggc ace ata 
Gly Thr He 
135 

ate tgc ata 
He Cys He 

aag cac tgg 



gag cag aaa 
Glu Gin Lys 
10 

gcg tgg gcg 
Ala Trp Ala 

cgt cct gcg 
Arg Pro Ala 

ggc ate acg 
Gly He Thr 
60 

tta cac gca 
Leu His Ala 
75 

cac tgg ttg 
His Trp Leu 
90 

age age eta 
Ser Ser Leu 

tac act ggt 
Tyr Thr Gly 

get ttg agg 
Ala Leu Arg 
140 

tea ctg tac 
Ser Leu Tyr 
155 

gag cac cac 



ggc agt gag 
Gly Ser Glu 
15 

aca cag tat 
Thr Gin Tyr 
30 

eta aag cac 
Leu Lys His 
45 

atg gcg ctg 
Met Ala Leu 



ata ttt caa 
He Phe Gin 



cct gtg tec 
Pro Val Ser 
95 

ctg cac ate 
Leu His He 
110 

eta ttc ate 
Leu Phe lie 
125 

cac agg cag 
His Arg Gin 

gee tgg ttt 
Ala Trp Phe 

aac cat act 



gca get get 
Ala Ala Ala 

cac atg cca 
His Met Pro 

gee tac aaa 
Ala Tyr Lys 
50 

ace ate att 
Thr He He 
65 

ate agg eta 
He Arg Leu 
80 

gaa gee aca 
Glu Ala Thr 



get gca gtc 
Ala Ala Val 



ace aca cat 
Thr Thr His 
130 

etc aat gat 
Leu Asn Asp 
145 

gac tac age 
Asp Tyr Ser 
160 

ggc gaa gtg 



tec 224 
Ser 



tec 272 

Ser 

35 

cct 320 
Pro 



ggc 368 
Gly 

ccg 416 
Pro 



gee 4 64 

Ala 



ttc 512 

Phe 

115 

gac 560 
Asp 

etc 608 
Leu 

atg 656 
Met 

ggg 704 
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Leu His Arg Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly 
165 170 175 

aaa gac cct gac ttc cac aag gga aat ccc ggc ctt gtc ccc tgg ttc 752 
Lys Asp Pro Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe 
180 185 190 195 

gcc age ttc atg tec age tac atg tec ctg tgg cag ttt gee egg ctg 800 
Ala Ser Phe Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu 
200 205 210 

gca tgg tgg gca gtg gtg atg caa atg ctg ggg gcg ccc atg gca aat 848 
Ala Trp Trp Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn 
215 220 225 

etc eta gtc ttc atg get gca gcc cca ate ttg tea gca ttc cgc etc 8 96 

Leu Leu Val Phe Met Ala Ala Ala Pro lie Leu Ser Ala Phe Arg Leu 
230 235 240 

ttc tac ttc ggc act tac ctg cca cac aag cct gag cca ggc cct gca 944 
Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu Pro Gly Pro Ala 
245 250 255 

r 

gca ggc tct cag gtg atg gcc tgg ttc agg gcc aag aca agt gag gca 992 

Ala Gly Ser Gin Val 'Met Ala Trp Phe Arg Ala Lys Thr Ser Glu Ala 

260 265 270 275 

tct gat gtg atg agt ttc ctg aca tgc tac cac ttt gac ctg cac tgg 1040 
Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp Leu His Trp 
280 285 290 

gag cac cac agg tgg ccc ttt gcc ccc tgg tgg cag ctg ccc cac tgc 1088 
Glu His His Arg Trp Pro Phe Ala Pro Trp Trp . Gin Leu Pro His Cys 
295 300 305 

cgc cgc ctg tec ggg cgt ggc ctg gtg cct gcc ttg gca tga 1130 
Arg Arg Leu Ser Gly Arg Gly Leu Val Pro. Ala Leu Ala 
310 315 320 



cctggtccct ccgctggtga cccagcgtct gcacaagagt gtcatgetae agggtgctgc 1190 
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ggccagtggc agcgcagtgc actctcagcc tgtatggggc taccgctgtg ccactgagca 1250 

ctgggcatgc cactgagcac tgggcgtgct actgagcaat gggcgtgcta ctgagcaatg 1310 

ggcgtgctac tgacaatggg cgtgctactg gggtctggca gtggctagga tggagtttga 1370 

tgcattcagt agcggtggcc aacgtcatgt ggatggtgga agtgctgagg ggtttaggca 1430 

gccggcattt gagagggcta agttataaat cgcatgctgc tcatgcgcac atatctgcac 14 90 

acagccaggg aaatcccttc gagagtgatt atgggacact tgtattggtt tcgtgctatt 1550 

gttttattca gcagcagtac ttagtgaggg tgagagcagg gtggtgagag tggagtgagt 1610 

gagtatgaac ctggtcagcg aggtgaacag cctgtaatga atgactctgt ct 1662 



<210> 14 
<211> 320 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 14 

Met His Val Ala Ser -Ala Leu Met Val Glu Gin Lys Gly Ser Glu Ala 
15 10 15 



Ala Ala Ser Ser Pro Asp Val Leu Arg Ala Trp Ala Thr Gin Tyr His 
20 25 30 



Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro Ala Leu Lys His Ala 
35 40 45 



Tyr Lys Pro Pro Ala Ser Asp Ala Lys Gly lie Thr Met Ala Leu Thr 
50 55 60 



He He Gly. Thr Trp Thr Ala Val Phe Leu His 



Ala He Phe Gin He 
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65 



Arg Leu Pro Thr 

Ala Thr Ala Gin 
100 

Ala Val Phe lie 
115 



Thr His Asp Ala 
130 



Asn Asp Leu Leu 
145 



Tyr Ser Met Leu 



Glu Val Gly Lys 
180 



Pro Trp Phe Ala 
195 



Ala Arg Leu Ala 
210 



Met Ala Asn Leu 



70 



Ser Met Asp Gin 
85 



Leu Leu Gly Gly 



Val Leu Glu Phe 
120 



Met His Gly Thr 
135 



Gly Asn lie Cys 
150 



His "Arg Lys His 
165 



Asp Pro Asp Phe 



Ser Phe Met Ser 
200 



Trp Trp Ala Val 
215 



Leu Val Phe Met 



75 



Leu His Trp Leu 
90 

Ser Ser Ser Leu 
105 

Leu Tyr Thr Gly 



lie Ala Leu Arg 
140 



lie Ser Leu Tyr 
155 



Trp Glu His His 
170 



His Lys Gly Asn 
185 



Ser Tyr Met Ser 



Val Met Gin Met 
220 



Ala Ala Ala Pro 



80 



Pro Val Ser Glu 
95 

Leu His lie Ala 
110 

Leu Phe lie Thr 
125 



His Arg Gin Leu 



Ala Trp Phe Asp 
160 



Asn His Thr Gly 
175 



Pro Gly Leu Val 
190 



Leu Trp Gin Phe 
205 



Leu Gly Ala Pro 



lie Leu Ser Ala 
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225 230 235 240 



Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro His Lys Pro Glu Pro 
245 250 255 



Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp Phe Arg Ala Lys Thr 
260 265 270 



Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr Cys Tyr His Phe Asp 

275 280 285 

Leu His Trp Glu His His Arg Trp Pro Phe Ala Pro Trp Trp Gin Leu 
290 295 300 



Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu Val Pro Ala Leu Ala 
305 ' 310 315 320 



<210> 15 

<211> 729 

<212> DNA 

<213> Agrobacterium aurantiacum 



<220> 

<221> CDS 

<222> (1) . . (729) 

<400> 15 

atg age. gca cat gec ctg ccc aag gca gat ctg acc gec acc age ctg 48 

Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 

15 10 15 



ate gtc teg ggc ggc ate ate gee get tgg ctg gec ctg cat gtg cat 96 
lie Val Ser Gly Gly lie lie Ala Ala Trp Leu Ala Leu His Val His 
20 25 30 
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gcg ctg tgg 
Ala Leu Trp 
35 

aat ttc ctg 
Asn Phe Leu 
50 

cat gac gcg 
His Asp Ala 
65 

gcg gcg atg 
Ala Ala Met 



cgc aag atg 
Arg Lys Met 

gac gac gac 
Asp Asp Asp 
115 

cgc ttc ate 
Arg Phe lie 
130 

gtc ate gtg 
Val He Val 

14 '5 

gtg gtc ttc 
Val Val Phe 



gtg ttc ggc 
Val Phe Gly 



ttt ctg gac 
Phe Leu Asp 

ggg ctg acc 
Gly Leu Thr 



atg cac ggg 
Met His Gly 
70 

ggc cag ctt 
Gly Gin Leu 
85 

ate gtc aag 
He Val Lys 
100 

ccc gat ttc 
Pro Asp Phe 

ggc acc tat 
Gly Thr Tyr 

acg gtc tat 
Thr Val Tyr 
150 

tgg ccg ctg 
Trp Pro Leu 
165 

acc tgg ctg 
Thr Trp Leu 
180 



gca gcg gcg 
Ala Ala Ala 
40 

tgg ctg teg 
Trp Leu Ser 
55 

teg gtg gtg 
Ser Val Val 



gtc ctg tgg 
Val Leu Trp 



cac atg gee 
His Met Ala 
105 

gac cat ggc 
Asp His Gly 
120 

ttc ggc tgg 
Phe Gly Trp 
135 

gcg ctg ate 
Ala Leu He 



ccg teg ate 
Pro Ser He 



ccg cac cgc 
Pro His Arg 
185 



cat ccc ate 
His Pro He 



gtc gga ttg 
Val Gly Leu 
60 

ccg ggg cgt 
Pro Gly Arg 
75 

ctg tat gee 
Leu Tyr Ala 
90 

cat cac cgc 
His His Arg 

ggc ccg gtc 
Gly Pro Val 



cgc gag ggg 
Arg Glu Gly 
140 

ctt ggg gat 
Leu Gly Asp 
155 

ctg gcg teg 
Leu Ala Ser 
170 

ccc ggc cac 
Pro Gly His 



ctg gcg ate 
Leu Ala lie 
45 

ttc ate ate 
Phe He He 



ccg cgc gee 
Pro Arg Ala 

gga ttt teg 
Gly Phe Ser 
95 

cat gee gga 
His Ala Gly 
110 

cgc tgg tac 
Arg Trp Tyr 
125 

ctg ctg ctg 
Leu Leu Leu 

cgc tgg atg 
Arg Trp Met 

ate cag ctg 
He Gin Leu 
175 

gac gcg ttc 
Asp Ala Phe 
190 



gca 144 
Ala 



gcg 192 
Ala 

aat 240 

Asn 

80 

tgg 288 
Trp 

acc 336 
Thr 



gec 384 
Ala 

ccc 432 
Pro 

tac 480 

Tyr 

160 

ttc 528 
Phe 



ccg 576 
Pro 
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gac cgc cac 
Asp Arg His 
195 

ctg acc tgc 
Leu Thr Cys 
210 

ccg acg gtg 
Pro Thr Val 
225 

acc gca tga 
Thr Ala 



aat gcg egg 
Asn Ala Arg 

ttt cac ttt 
Phe His Phe 



ccg tgg tgg 
Pro Trp Trp 
230 



teg teg egg 
Ser Ser Arg 
200 

ggc ggt tat 
Gly Gly Tyr 
215 

cgc ctg ccc 
Arg Leu Pro 



ate age gac 
lie Ser Asp 

cat cac gaa 
His His Glu 
220 

age acc cgc 
Ser Thr Arg 
235 



ccc gtg teg 
Pro Val Ser 
205 

cac cac ctg 
His His Leu 

acc aag ggg 
Thr Lys Gly 



ctg 624 
Leu 

cac 672 
His 

gac 720 

Asp 

240 

729 



<210> 16 

<211> 242 

<212> PRT 

<213> Agrobacterium aurantiacum 

<400> 16 



Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
15 10 15 



lie Val Ser Gly Gly lie lie Ala Ala Trp Leu Ala Leu His Val His 
20 25 30 



Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro lie Leu Ala lie Ala 
35 40 45 



Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe lie lie Ala 
50 55 60 
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His Asp Ala Met 
65 



Ala Ala Met Gly 



Arg Lys Met lie 
100 



Asp Asp Asp Pro 
115 



Arg Phe lie Gly 
130 



Val lie Val Thr 
145 



Val Val Phe Trp 



Val Phe Gly Thr 
180 



Asp Arg His Asn 
195 



Leu. Thr Cys Phe 
210 



His Gly Ser Val 
70 



Gin Leu Val Leu 
85 



Val Lys His Met 



Asp Phe Asp His 
120 



Thr Tyr Phe Gly 
135 



Val Tyr Ala Leu 
150 



Pro 'Leu Pro Ser 
165 



Trp Leu Pro His 



Ala Arg ' Ser Ser 
200 



His Phe Gly Gly 
215 



Val Pro Gly Arg 
75 



Trp Leu Tyr Ala 
90 



Ala His His Arg 
105 



Gly Gly Pro Val 



Trp Arg Glu Gly 
140 



lie Leu Gly Asp 
155 



lie Leu Ala Ser 
170 



Arg Pro Gly His 
185 



Arg lie Ser Asp 



Tyr His His Glu 
220 



Pro Arg Ala Asn 
80 



Gly Phe Ser Trp 
95 

His Ala Gly Thr 
110 



Arg Trp Tyr Ala 
125 



Leu Leu Leu Pro 



Arg Trp Met Tyr 
160 



lie Gin Leu Phe 
175 



Asp Ala Phe Pro 
190 



Pro Val Ser Leu 
205 



His" His Leu His 



Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
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225 230 235 240 



Thr Ala 



<210> 17 

<211> 1631 

<212> DNA 

<213> Alcaligenes sp. 



<220> 

<221> CDS 

<222> (99) . . (827) 

<400> 17 

ctgcaggccg ggcccggtgg ccaatggtcg caaccggcag gactggaaca ggacggcggg 60 

ccggtctagg ctgtcgccct acgcagcagg agtttcgg atg tec gga egg aag cct 116 

Met Ser Gly Arg Lys Pro 
1 5 

ggc aca act ggc gac-acg ate gtc aat etc ggt ctg ace gee gcg ate 164 
Gly Thr Thr Gly Asp Thr He Val Asn Leu Gly Leu Thr Ala Ala He 
10 15 20 

ctg ctg tgc tgg ctg gtc ctg cac gee ttt acg eta tgg ttg eta gat 212 
Leu Leu Cys Trp Leu Val Leu His Ala Phe Thr Leu Trp Leu Leu Asp 
25 30 35 

gcg gee gcg cat ccg ctg ctt gee gtg ctg tgc ctg get ggg ctg ace 260 
Ala Ala Ala His Pro Leu Leu Ala Val Leu Cys Leu Ala Gly Leu Thr 
40 45 50 

tgg ctg teg gtc ggg ctg ttc ate ate gcg cat gac gca atg cac ggg 308 
Trp Leu Ser Val Gly Leu Phe He He Ala His Asp Ala Met His Gly 
55 60 65 70 



tec gtg gtg ccg ggg egg ccg cgc gee aat gcg gcg ate ggg caa ctg 



356 
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Ser Val Val 



gcg ctg tgg 
Ala Leu Trp 



cac atg acg 
His Met Thr 
105 

ggt cac gga 
Gly His Gly 
120 

ttc ggc tgg 
Phe Gly Trp 
135 

gcg ctg ate 
Ala Leu lie 



ccg gec gtt 
Pro Ala Val 



ccc cac cgc 
Pro His Arg 
185 

teg acc ggc 
Ser Thr Gly 
200 

ggc ggc tat 
Gly Gly Tyr 
215 

cgc ctg cct 
Arg Leu Pro 



Pro Gly Arg 
75 

etc tat gcg 
Leu Tyr Ala 
90 

cat cac egg 
His His Arg 

ggg ccc gtg 
Gly Pro Val 



cga gag gga 
Arg Glu Gly 
140 

ctg ggc gat 
Leu Gly Asp 
155 

ctg gcg teg 
Leu Ala "Ser 
170 

ccg gga cat 
Pro Gly His 



ate ggc gac 
lie Gly Asp 



cac cac gaa 
His His Glu 
220 

cgt aca cgc 
Arg Thr Arg 



Pro Arg Ala 

ggg ttc teg 
Gly Phe Ser 
95 

cac gee ggc 
His Ala Gly 
110 

cgc tgg tac 
Arg Trp Tyr 
125 

ctg ctg eta 
Leu Leu Leu 

cgc tgg atg 
Arg Trp Met 

ate cag att 
He Gin lie 
175 

gac gat ttt 
Asp Asp Phe 
190 

ccg ttg tea 
Pro Leu Ser 
205 

cat cac ctg 
His His Leu 



aag acc gga 
Lys Thr Gly 



Asn Ala Ala 
80 

tgg ccc aag 
Trp Pro Lys 

acc gac aac 
Thr Asp Asn 

ggc age ttc 
Gly Ser Phe 
130 

ccg gtg ate 
Pro Val He 
145 

tat gtc ate 
Tyr Val He 
160 

ttc gtc ttc 
Phe Val Phe 



ccc gac egg 
Pro Asp Arg 

eta ctg acc 
Leu Leu Thr 
210 

cat ccg cat 
His Pro His 
225 

ggc cgc gca 
Gly Arg Ala 



He Gly Gin 
85 

ctg ate gee 
Leu He Ala 
100 

gat ccc gat 
Asp Pro Asp 
115 

gtc tec acc 
Val Ser Thr 



gtc acc acc 
Val Thr Thr 



ttc tgg ccg 
Phe Trp Pro 
165 

gga act tgg 
Gly Thr Trp 
180 

cac aac gcg 
His Asn Ala 
195 

tgc ttc cat 
Cys Phe His 

gtg ccg tgg 
Val Pro Trp 



tga 



Leu 

aag 404 
Lys 

ttc 452 
Phe 

tat 500 
Tyr 

tat 548 

Tyr 

150 

gtc 596 
Val 

ctg 644 
Leu 

agg 692 
Arg 

ttc 740 
Phe 

tgg 788 

Trp 

230 

:ct 837 
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235 240 

cattgtcgtg gcgacagtcc tcgtgatgga gctgaccgcc tattccgtcc accgctggat 897 

tatgcacggc cccctaggct ggggctggca caagtcccat cacgaagagc acgaccacgc 957 

gttggagaag aacgacctct acggcgtcgt cttcgcggtg ctggcgacga tcctcttcac 1017 

cgtgggcgcc tattggtggc cggtgctgtg gtggatcgcc ctgggcatga cggtctatgg 1077 

gttgatctat ttcatcctgc acgacgggct tgtgcatcaa cgctggccgt ttcggtatat 1137 

tccgcggcgg ggctatttcc gcaggctcta ccaagctcat cgcctgcacc acgcggtcga 1197 

ggggcgggac cactgcgtca gcttcggctt catctatgcc ccacccgtgg acaagctgaa. 1257 

gcaggatctg aagcggtcgg gtgtcctgcg cccccaggac gagcgtccgt cgtgatctct 1317 

gatcccggcg tggccgcatg aaatccgacg tgctgctggc aggggccggc cttgccaacg 1377 

gactgatcgc gctggcgatc cgcaaggcgc ggcccgacct tcgcgtgctg ctgctggacc 1437 

gtgcggcggg cgcctcggac gggcatactt ggtcctgcca cgacaccgat ttggcgccgc 14 97 

actggctgga ccgcctgaag ccgatcaggc gtggcgactg gcccgatcag gaggtgcggt 1557 

tcccagacca ttcgcgaagg ctccgggccg gatatggctc gatcgacggg cgggggctga 1617 

tgcgtgcggt gacc 1631 



<210> 18 
<211> 242 
<212> PRT 

<213> Alcaligenes sp. 
<400> 18 

Met Ser Gly Arg Lys Pro Gly Thr Thr Gly Asp Thr lie Val Asn Leu 
15 10 15 
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Gly Leu Thr Ala 
20 



Thr Leu Trp Leu 

35 • 



Cys Leu Ala Gly 
50 



His Asp Ala Met 
65 



Ala Ala He Gly 



Pro Lys Leu He 
100 



Asp Asn Asp Pro 
115 



Ser Phe Val Ser 
130 



Val He Val Thr 
145 



Val He Phe Trp 



Ala He Leu Leu 



Leu Asp Ala Ala 
40 



Leu Thr Trp Leu 
55 



His Gly Ser Val 
70 



Gin Leu Ala Leu 

85. 



Ala Lys His Met 



Asp Phe Gly His 
120 



Thr Tyr Phe Gly 
135 



Thr Tyr Ala Leu 
150 



Pro Val Pro Ala 
165 



Cys Trp Leu Val 
25 



Ala His Pro Leu 



Ser Val Gly Leu 
60 



Val Pro Gly Arg 
75 



Trp Leu Tyr Ala 
90 



Thr His His Arg 
105 



Gly Gly Pro Val 



Trp Arg Glu Gly 
140 



He Leu Gly Asp 
155 



Val Leu Ala Ser 
170 



Leu His Ala Phe 
30 



Leu Ala Val Leu 
45 



Phe He He Ala 



Pro Arg Ala Asn 
80 



Gly Phe Ser Trp 
95 



His Ala Gly Thr 
110 



Arg Trp Tyr Gly 
125 



Leu Leu Leu Pro 



Arg Trp Met Tyr 
160 



He Gin He Phe 
175 
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Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Asp Phe Pro 



180 



185 



190 



Asp Arg His Asn Ala Arg Ser Thr Gly lie Gly Asp Pro Leu Ser Leu 



195 



200 



205 



Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 



210 



215 



220 



Pro His Val Pro Trp Trp Arg Leu Pro Arg Thr Arg Lys Thr Gly Gly 



225 



230 



235 



240 



Arg Ala 



<210> 19 

<211> 729 

<212> DNA 

<213> Paracoccus marcusii 
<220> 

<221> CDS 

<222> (1) . . (729) 

<400> 19 

atg age gca cat gec ctg ccc aag gca gat ctg acc gec aca age ctg 48 
Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
15 10 15 

ate gtc teg ggc ggc ate ate gee gca tgg ctg gee ctg cat gtg cat 96 
lie Val Ser Gly Gly lie lie Ala Ala Trp Leu Ala Leu His Val His 



20 



25 



30 



gcg ctg tgg ttt ctg gac gcg gcg gec cat ccc ate ctg gcg gtc gcg 



144 
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Ala Leu Trp 
35 

aat ttc ctg 
Asn Phe Leu 
50 

cat gac gcg 
His Asp Ala 
65 

gcg gcg atg 
Ala Ala Met 



cgc aag atg 
Arg Lys Met 

gac gac gac 
Asp Asp Asp 
115 

cgc ttc ate 
Arg Phe lie 
130 

gtc ate gtg 
Val lie Val 
145 

gtg gtc ttc 
Val Val Phe 



gtg ttc ggc 
Val Phe Gly 

gac cgc cat 
Asp Arg His 



Phe Leu Asp 



ggg ctg acc 
Gly Leu Thr 



atg cac ggg 
Met His Gly 
70 

ggc cag ctt 
Gly Gin Leu 
85 

ate gtc aag 
He Val Lys 
100 

cca gat ttc 
Pro Asp Phe 

ggc acc tat 
Gly Thr -Tyr 

acg gtc tat 
Thr Val Tyr 
150 

tgg ccg ttg 
Trp Pro Leu 
165 

act tgg ctg 
Thr Trp Leu 
180 

aat gcg egg 
Asn Ala Arg 



Ala Ala Ala 
40 

tgg ctg teg 
Trp Leu Sex 
55 

teg gtc gtg 
Ser Val Val 



gtc ctg tgg 
Val Leu Trp 

cac atg gee 
His Met Ala 
105 

gac cat ggc 
Asp His Gly 
120 

ttc ggc tgg 
Phe Gly Trp 
135 

gcg ctg ate 
Ala Leu He 



ccg teg ate 
Pro Ser He 



ccg cac cgc 
Pro His Arg 
185 

teg teg egg 
Ser Ser Arg 



His Pro He 



gtc gga ttg 
Val Gly Leu 
60 

ccg ggg cgt 
Pro Gly Arg 
75 

ctg tat gee 
Leu Tyr Ala 
90 

cat cac cgc 
His His Arg 

ggc ccg gtc 
Gly Pro Val 



cgc gag ggg 
Arg Glu Gly 
140 

ctg ggg gat 
Leu Gly Asp 
155 

ctg gcg teg 
Leu Ala Ser 
170 

ccc ggc cac 
Pro Gly His 

ate age gac 
He Ser Asp 



Leu Ala Val 
45 

ttc ate ate 
Phe He He 



ccg cgc gee 
Pro Arg Ala 

gga ttt teg 
Gly Phe Ser 
95 

cat gee gga 
His Ala Gly 
110 

cgc tgg tac 
Arg Trp Tyr 
125 

ctg ctg ctg 
Leu Leu Leu 

cgc tgg atg 
Arg Trp Met 

ate cag ctg 
He Gin Leu 
175 

gac gcg ttc 
Asp Ala Phe 
190 

cct gtg teg 
Pro Val Ser 



Ala 

gcg 192 
Ala 

aat 240 

Asn 

80 

tgg 288 
Trp 

acc 336 
Thr 



gee 384 
Ala 

ccc 432 
Pro 

tac 480 

Tyr 

160 

ttc 528 
Phe 

ccg 576 
Pro 

ctg 624 
Leu 
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195 200 205 

ctg acc tgc ttt cat ttt ggc ggt tat cat cac gaa cac cac ctg cac 672 
Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 

ccg acg gtg ccg tgg tgg cgc ctg ccc age acc cgc acc aag ggg gac 720 
Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
225 230 235 240 

acc gca tga 729 
Thr Ala 



<210> 20 
<211> 242 
<212> PRT 

<213> Paracoccus marcusii 
<400> 20 

Met Ser Ala His Ala Leu Pro Lys Ala Asp Leu Thr Ala Thr Ser Leu 
15 10 15 



lie Val Ser Gly Gly lie lie Ala Ala Trp Leu Ala Leu His Val His 
20 25 30 



Ala Leu Trp Phe Leu Asp Ala Ala Ala His Pro lie Leu Ala Val Ala 
35 40 45 



Asn Phe Leu Gly Leu Thr Trp Leu Ser Val Gly Leu Phe lie lie Ala 
50 55 60 



His Asp Ala Met His Gly Ser Val Val Pro Gly Arg Pro Arg Ala Asn 
65 70 75 80 
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Ala Ala Met Gly Gin Leu Val Leu Trp Leu Tyr Ala Gly Phe Ser Trp 
85 90 95 



Arg Lys Met lie Val Lys His Met Ala His His Arg His Ala Gly Thr 
100 105 110 



Asp Asp Asp Pro Asp Phe Asp His Gly Gly Pro Val Arg Trp Tyr Ala 
115 120 125 



Arg Phe lie Gly Thr Tyr Phe Gly Trp Arg Glu Gly Leu Leu Leu Pro 
130 135 140 



Val lie Val Thr Val Tyr Ala Leu lie Leu Gly Asp Arg Trp Met Tyr 
145 150 155 160 



Val Val Phe Trp Pro Leu Pro Ser lie Leu Ala Ser lie Gin Leu Phe 
165 170 175 



Val Phe Gly Thr Trp Leu Pro His Arg Pro Gly His Asp Ala Phe Pro 
180 185 190 



Asp Arg His Asn Ala Arg Ser Ser Arg lie Ser Asp Pro Val Ser Leu 
195 200 205 



Leu Thr Cys Phe His Phe Gly Gly Tyr His His Glu His His Leu His 
210 215 220 



Pro Thr Val Pro Trp Trp Arg Leu Pro Ser Thr Arg Thr Lys Gly Asp 
225 230 235 240 
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Thr Ala 



<210> 21 

<211> 1629 

<212> DNA 

<213> Synechocystis sp. 
<220> 

<221> CDS 

<222> (1) . . (1629) 

<400> 21 

atg ate acc acc gat gtt gtc att att ggg gcg ggg cac aat ggc tta 48 

Met He Thr Thr Asp Val Val He He Gly Ala Gly His Asn Gly Leu 
15 10 15 

gtc tgt gca' gec tat ttg etc caa egg ggc ttg ggg gtg acg tta eta 96 
Val Cys Ala Ala Tyr Leu Leu Gin Arg Gly Leu Gly Val Thr Leu Leu 
20 25 30 

gaa aag egg gaa gta xca ggg ggg gcg gee acc aca gaa get etc atg 144 
Glu Lys Arg Glu Val Pro Gly Gly Ala Ala Thr Thr Glu Ala Leu Met 
35 40 45 

ccg gag eta tec ccc cag ttt cgc ttt aac cgc tgt gec att gac cac 192 
Pro Glu Leu Ser Pro Gin Phe Arg Phe Asn Arg Cys Ala He Asp His 
50 55 60 

gaa ttt ate ttt ctg ggg ccg gtg ttg cag gag eta aat tta gec cag 240 
Glu Phe He- Phe Leu Gly Pro Val Leu Gin Glu Leu Asn Leu Ala Gin 
65 70 75 80 



tat ggt ttg gaa tat tta ttt tgt gac ccc agt gtt ttt tgt ccg ggg 288 
Tyr Gly Leu Glu Tyr Leu Phe Cys Asp Pro Ser Val Phe Cys Pro Gly 
85 90 95 
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ctg gat ggc 
Leu Asp Gly 



gcc cac att 
Ala His He 
115 

ttt gtc aat 
Phe Val Asn 
130 

aat get ccg 
Asn Ala Pro 
145 

gaa aac tta 
Glu Asn Leu 

ttg gat ttt 
Leu Asp Phe 

gaa tgg ttc 
Glu Trp Phe 
195 



caa get ttt 
Gin Ala Phe 
100 

gcc acc tat 
Ala Thr Tyr 

tat tgg acg 
Tyr Trp Thr 

ccc cag get 
Pro Gin Ala 
150 

aaa tec gtg 
Lys Ser Val 
165 

ate cgc act 
He Arg Thr 
180 

gac age -gaa 
Asp Ser Glu 



atg age tac 
Met Ser Tyr 
105 

age ccc cga 
Ser Pro Arg 
120 

gat ttg etc 
Asp Leu Leu 
135 

tta eta gat 
Leu Leu Asp 

ctg gcg ate 
Leu Ala He 



atg ate ggc 
Met He Gly 
185 

egg gtt aaa 
Arg Val Lys 
200 



cgt tec eta 
Arg Ser Leu 

gat gcg gaa 
Asp Ala Glu 

aac get gtc 
Asn Ala Val 
140 

tta gcc ctg 
Leu Ala Leu 
155 

gcc ggg teg 
Ala Gly Ser 
170 

tec ccg gaa 
Ser Pro Glu 

get cct tta 
Ala Pro Leu 



gaa aaa acc 
Glu Lys Thr 
110 

aaa tat egg 
Lys Tyr Arg 
125 

cag cct get 
Gin Pro Ala 



aac tat ggt 
Asn Tyr Gly 

aaa acc aag 
Lys Thr Lys 
175 

gat gtg etc 
Asp Val Leu 
190 

get aga eta 
Ala Arg Leu 
205 



tgt 336 
Cys 

caa 384 
Gin 

ttt 432 
Phe 

tgg 480 

Trp 

160 

gcg 528 
Ala 

aat 576 
Asn 

tgt 624 

Cys 



teg gaa att ggc get ccc cca tec caa 
Ser Glu He Gly Ala Pro Pro Ser Gin 
210 215 

atg atg gtg gcc atg egg cat ttg gag 
Met Met Val Ala Met Arg His Leu Glu 
225 230 

ggc act gga gcc etc aca gaa gcc ttg 
Gly Thr Gly Ala Leu Thr Glu Ala Leu 
245 



aag ggt agt age tec ggc atg 672 
Lys Gly Ser Ser Ser Gly Met 
220 

gga att gcc aga cca aaa gga 720 
Gly He Ala Arg Pro Lys Gly 
235 240 

gtg aag tta gtg caa gcc caa 768 
Val Lys Leu Val Gin Ala Gin 
250 255 
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ggg gga aaa 
Gly Gly Lys 

aac aac cag 
Asn Asn Gin 
275 

gcc aaa aaa 
Ala Lys Lys 
290 

caa ttg gtg 
Gin Leu Val 
305 

gaa cga ctg 
Glu Arg Leu 



ate gat tgt 
lie Asp Cys 

ccg gag gat 
Pro Glu Asp 
355 

gtc gag gaa 
Val Glu Glu 
370 

aat ccg tct 
Asn Pro Ser 
385 

gcc ccc cct 
Ala Pro Pro 



ate etc act 
lie Leu Thr 
260 

gcg ate ggg 
Ala He Gly 



ggc gtg att 
Gly Val He 



gaa ccg ggg 
Glu Pro Gly 
310 

gaa egg cgc 
Glu Arg Arg 
325 



gcc etc tec 
Ala Leu Ser 
340 

eta acg gga 
Leu Thr Gly 



gcc cac gcc 
Ala His Ala 

tta tat ttg 
Leu Tyr Leu 
390 

ggg cag cac 
Gly Gin His 
405 



gac caa ace 
Asp Gin Thr 
265 

gtg gag gta 
Val Glu Val 
280 

tct aac ate 
Ser Asn He 
295 

gcc eta gcc 
Ala Leu Ala 



act gtg aac 
Thr Val Asn 



ggt tta ccc 
Gly Leu Pro 
345 

act att ttg 
Thr He Leu 
360 

etc att gcc 
Leu lie Ala 
375 

gat att ccc 
Asp He Pro 

ace etc tgg 
Thr Leu Trp 



gtc aaa egg 
Val Lys Arg 

get aac gga 
Ala Asn Gly 

gat gcc cgc 
Asp Ala Arg 
300 

aag gtg aat 
Lys Val Asn 
315 

aat aac gaa 
Asn Asn Glu 
330 

cac ttc act 
His Phe Thr 



att gcc gac 
lie Ala Asp 

ttg ggg caa 
Leu Gly Gin 
380 

act gta ttg 
Thr Val Leu 
395 

ate gaa ttt 
He Glu Phe 
410 



gta ttg gtg 
Val Leu Val 
270 

gaa cag tac 
Glu Gin Tyr 
285 

cgt tta ttt 
Arg Leu Phe 

caa aac eta 
Gin Asn Leu 

gcc att tta 
Ala lie Leu 
335 



gcc atg gcc 
Ala Met Ala 
350 

teg gta cgc 
Ser Val Arg 
365 

att ccc gat 
lie Pro Asp 

gac ccc ace 
Asp Pro Thr 

ttt gcc ccc 
Phe Ala Pro 
415 



gaa 816 
Glu 

egg 864 
Arg 

ttg 912 
Leu 



ggg 960 

Gly 

320 

aaa 1008 
Lys 



ggg 1056 
Gly 

cat 1104 
His 



get 1152 
Ala 

atg 1200 

Met 

400 

tac 1248 
Tyr 
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cgc ate gee ggg ttg gaa 
Arg lie Ala Gly Leu Glu 
420 

gat gag tta aag gaa aaa 
Asp Glu Leu Lys Glu Lys 
435 

gac tat gec cct aac eta 
Asp Tyr Ala Pro Asn Leu 
450 
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ggg aca ggg tta atg ggc 
Gly Thr Gly Leu Met Gly 
425 

gtg gcg gat egg gtg att 
Val Ala Asp Arg Val He 
440 

aaa tct ctg ate att ggt 
Lys Ser Leu He He Gly 
455 460 
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aca ggt tgg ace 1296 
Thr Gly Trp Thr 
430 

gat aaa tta acg 1344 

Asp Lys Leu Thr 

445 

cgc cga gtg gaa 1392 
Arg Arg Val Glu 



agt ccc gee gaa ctg gec caa egg ctg gga agt tac aac ggc aat gtc 1440 

Ser Pro Ala Glu Leu Ala Gin Arg Leu Gly Ser Tyr Asn Gly Asn Val 
465 470 475 480 

tat cat ctg gat atg agt ttg gac caa atg atg ttc etc egg cct eta 1488 

Tyr His Leu Asp Met Ser Leu Asp Gin Met Met Phe Leu Arg Pro Leu 

485 490 495 

ccg gaa att gee aac tac caa acc ccc ate aaa aat ctt tac tta aca 1536 

Pro Glu He Ala Asn Tyr Gin Thr Pro He Lys Asn Leu Tyr Leu Thr 

500 505 510 

ggg gcg ggt acc cat ccc ggt ggc tec ata tea ggt atg ccc ggt aga 1584 

Gly Ala Gly Thr His Pro Gly Gly Ser He Ser Gly Met Pro Gly Arg 
515 520 525 

aat tgc get egg gtc ttt tta aaa caa caa cgt cgt ttt tgg taa 1629 
Asn Cys Ala Arg Val Phe Leu Lys Gin Gin Arg Arg Phe . Trp 
530 535 540 



<210> 22 

<211> 542 

<212> PRT 

<213> Synechocystis sp. 



<400> 22 
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Met He Thr Thr Asp Val Val He He Gly Ala Gly His Asn Gly Leu 
1 5 10 15 



Val Cys Ala Ala Tyr Leu Leu Gin Arg Gly Leu Gly Val Thr Leu Leu 
20 25 30 



Glu Lys Arg Glu Val Pro Gly Gly Ala Ala Thr Thr Glu Ala Leu Met 
35 40 45 



Pro Glu Leu Ser Pro Gin Phe Arg Phe Asn Arg Cys Ala He Asp His 
50 55 60 



Glu Phe He Phe Leu Gly Pro Val Leu Gin Glu Leu .Asn Leu Ala Gin 
65 70 75 80 



Tyr Gly Leu Glu Tyr Leu Phe Cys Asp Pro Ser Val Phe Cys Pro Gly 
85 90 95 



Leu Asp Gly Gin Ala Phe Met Ser Tyr Arg Ser Leu Glu Lys Thr Cys 
100 105 110 



Ala His lie Ala Thr Tyr Ser Pro Arg Asp Ala Glu Lys Tyr Arg Gin 
115 120 125 



Phe Val Asn Tyr Trp Thr Asp Leu Leu Asn Ala Val Gin Pro Ala Phe 
130 135 140 



Asn Ala Pro Pro Gin Ala Leu Leu Asp Leu Ala Leu Asn Tyr Gly Trp 
145 150 155 160 
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Glu Asn Leu Lys Ser Val Leu Ala lie Ala Gly Ser Lys Thr Lys Ala 
165 170 175 



Leu Asp Phe lie Arg Thr Met lie Gly Ser Pro Glu Asp Val Leu Asn 
180 185 190 



Glu Trp Phe Asp Ser Glu Arg Val Lys Ala Pro Leu Ala Arg Leu Cys 
195 200 205 



Ser Glu lie Gly Ala Pro Pro Ser Gin Lys Gly Ser Ser Ser Gly Met 
210 215 220 



Met Met Val Ala Met Arg His Leu Glu Gly lie Ala Arg Pro Lys Gly 
225 230 235 240 



Gly Thr Gly Ala Leu Thr Glu Ala Leu Val Lys Leu Val Gin Ala Gin 
245 250 255 



Gly Gly Lys lie Leu "Thr Asp Gin Thr Val Lys Arg Val Leu Val Glu 
260 265 270 



Asn Asn Gin Ala lie Gly Val Glu Val Ala Asn Gly Glu Gin Tyr Arg 
275 280 285 



Ala Lys Lys Gly Val lie Ser Asn lie Asp Ala Arg Arg Leu Phe Leu 
290 295 300 



Gin Leu Val Glu Pro Gly Ala Leu Ala Lys Val Asn Gin Asn Leu Gly 
305 310 315 320 



Glu Arg Leu Glu Arg Arg Thr Val Asn Asn Asn Glu Ala lie Leu Lys 
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lie Asp Cys Ala 
340 



Pro Glu Asp Leu 
355 



Val Glu Glu Ala 
370 



Asn Pro Ser Leu 
385 



Ala Pro Pro Gly 



Arg lie Ala Gly 
420 



Asp Glu Leu Lys 
435 



Asp Tyr Ala Pro 
450 



Ser Pro Ala Glu 
465 



325 



Leu Ser Gly Leu 



Thr Gly Thr lie 
360 



His Ala Leu lie 
375 



Tyr Leu Asp lie 
390 



Gin His Thr Leu 
405 



Leu Glu Gly Thr 



Glu Lys Val Ala 
440 



Asn Leu Lys Ser 
455 



Leu Ala Gin Arg 
470 



330 



Pro His Phe Thr 
345 



Leu lie Ala Asp 



Ala Leu Gly Gin 
380 



Pro Thr Val Leu 
395 



Trp lie Glu Phe 
410 



Gly Leu Met Gly 
425 



Asp Arg Val lie 



Leu lie lie Gly 
460 



Leu Gly Ser Tyr 
475 



335 



Ala Met Ala Gly 
350 



Ser Val Arg His 
365 



lie Pro Asp Ala 



Asp Pro Thr Met 
400 



Phe Ala Pro Tyr 
415 



Thr Gly Trp Thr 
430 



Asp Lys Leu Thr 
445 



Arg Arg Val Glu 



Asn Gly Asn Val 
480 



Tyr His Leu Asp Met Ser Leu Asp Gin Met Met Phe Leu Arg Pro Leu 
485 490 495 
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Pro Glu lie Ala Asn Tyr Gin Thr Pro lie Lys Asn Leu Tyr Leu Thr 
500 505 510 



Gly Ala Gly Thr His Pro Gly Gly Ser He Ser Gly Met Pro Gly Arg 
515- 520 525 



Asn Cys Ala Arg Val Phe Leu Lys Gin Gin Arg Arg Phe Trp 
530 535 540 



<210> 23 

<211> 776 

<212> DNA 

<213> Bradyrhizobium sp. 



<220> 

<221> CDS 

<222> (1) . . (774) 

<400> 23 

atg cat gca gca acc gcc aag get act gag ttc ggg gec tct egg cgc 48 
Met His Ala Ala Thr Ala Lys Ala Thr Glu Phe Gly Ala Ser Arg Arg 
15 10 15 

gac gat gcg agg cag cgc cgc gtc ggt etc acg ctg gcc gcg gtc ate 96 
Asp Asp Ala Arg Gin Arg Arg Val Gly Leu Thr Leu Ala Ala Val He 
20 25 30 

ate gcc gcc tgg ctg gtg ctg cat gtc ggt ctg atg ttc ttc tgg ccg 144 
He Ala Ala Trp Leu Val Leu His Val Gly Leu Met Phe Phe Trp Pro 
35 40 45 

ctg acc ctt cac age ctg ctg ccg get ttg cct ctg gtg gtg ctg cag 192 
Leu Thr Leu His Ser Leu Leu Pro Ala Leu Pro Leu Val Val Leu Gin 
50 55 60 
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acc tgg etc 
Thr Trp Leu 
65 

ggc teg ctg 
Gly Ser Leu 

etc tgc ctg 
Leu Cys Leu 

gag cac cac 
Glu His His 
115 

ttc gac gag 
Phe Asp Glu 
130 

ttc ctg cac 
Phe Leu His 
145 

teg ctg gtt 
Ser Leu Val 



ctg ttc tgg 
Leu Phe Trp 

ttc ggc acc 
Phe Gly Thr 
195 

cgc cac aac 
Arg His Asn 
210 



tat gta ggc 
Tyr Val Gly 
70 

gtg ccg ttc 
Val Pro Phe 
85 

ttc etc tat 
Phe Leu Tyr 
100 

aag cat cac 
Lys His His 

gtg ccg ccg 
Val Pro Pro 



tat ttc ggc 
Tyr Phe Gly 
150 

tat cag -etc 
Tyr Gin Leu 
165 

gcg ctg ccc 
Ala Leu Pro 
180 

tat ctg ccg 
Tyr Leu Pro 

gcg egg acg 
Ala Arg Thr 



ctg ttc ate 
Leu Phe lie 



aag ccg cag 
Lys Pro Gin 

gec ggg ttc 
Ala Gly Phe 
105 

cgc cat ccc 
Arg His Pro 
120 

cac ggc ttc 
His Gly Phe 
135 

tgg aag cag 
Trp Lys Gin 

gtc ttc gec 
Val Phe Ala 



ggg ctg ctg 
Gly Leu Leu 
185 

cac aag ccg 
His Lys Pro 
200 

age gaa ttt 
Ser Glu Phe 
215 



ate gcg cat 
He Ala His 
75 

gtc aac cgc 
Val Asn Arg 
90 

tec ttc gac 
Ser Phe Asp 



ggc acg gec 
Gly Thr Ala 



tgg cac tgg 
Trp His Trp 
140 

gtc gcg ate 
Val Ala He 
155 

gtt ccc ttg 
Val Pro Leu 
170 

teg gcg ctg 
Ser Ala Leu 



gee acg cag 
Ala Thr Gin 



ccc gcg tgg 
Pro Ala Trp 
220 



gac tgc atg 
Asp Cys Met 

cgt ate gga 
Arg He Gly 
95 

get etc aat 
Ala Leu Asn 
110 

gag gat ccc 
Glu Asp Pro 
125 

ttc gee age 
Phe Ala Ser 



ate gca gec 
He Ala Ala 



cag aac ate 
Gin Asn He 
175 

cag ctg ttc 
Gin Leu Phe 
190 

ccc ttc gee 
Pro Phe Ala 
205 

ctg teg ctg 
Leu Ser Leu 



cac 240 

His 

80 

cag 288 
Gin 

gtc 336 
Val 

gat 384 
Asp 

ttt 432 
Phe 



gtc 480 

Val 

160 

ctg 528 
Leu 

acc 576 
Thr 

gat 624 
Asp 

ctg 672 
Leu 



acc tgc ttc cac ttc ggc ttt cat cac gag cat cat ctg cat ccc gat 



720 
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Thr Cys Phe His Phe Gly Phe His His Glu His His Leu His Pro Asp 

225 230 235 240 

gcg ccg tgg tgg egg ctg ccg gag ate aag egg egg gee ctg gaa agg 768 
Ala Pro Trp Trp Arg Leu Pro Glu lie Lys Arg Arg Ala Leu Glu Arg 
245 250 255 

cgt gac ta 776 
Arg Asp 



<210> 24 
<211> 258 
<212> PRT 

<213> Bradyrhizobium sp. 
<400> 24 

Met His Ala Ala Thr Ala Lys Ala Thr Glu Phe Gly Ala Ser Arg Arg 
15 10 15 



Asp Asp Ala Arg Gin Arg Arg Val Gly Leu Thr Leu Ala Ala Val lie 
20 -• 25 30 



lie Ala Ala Trp Leu Val Leu His Val Gly Leu Met Phe Phe Trp Pro 
35 40 45 



Leu Thr Leu His Ser Leu Leu Pro Ala Leu Pro Leu Val Val Leu Gin 
50 55 60 



Thr Trp Leu Tyr Val Gly Leu Phe lie lie Ala His Asp Cys Met His 
65 70 75 80 



Gly Ser Leu Val Pro Phe Lys Pro Gin Val Asn Arg Arg lie Gly Gin 
85 90 95 
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Leu Cys Leu Phe 
100 



Glu His His Lys 
115 



Phe Asp Glu Val 
130 



Phe Leu His Tyr 
145 



Ser Leu Val Tyr 



Leu Phe Trp Ala 
180 



Phe Gly Thr Tyr 
195 



Arg His Asn Ala 
210 



Thr Cys Phe His 
225 



Ala Pro Trp Trp 



Leu Tyr Ala Gly 



His His Arg His 
120 



Pro Pro His Gly 
135 



Phe Gly Trp Lys 
150 



Gin Leu Val Phe 
165 



Leu Pro Gly Leu 



Leu Pro His Lys 
200 



Arg Thr Ser Glu 
215 



Phe Gly Phe His 
230 



Arg Leu Pro Glu 
245 



Phe Ser Phe Asp 
105 



Pro Gly Thr Ala 



Phe Trp His Trp 
140 



Gin Val Ala lie 
155 



Ala Val Pro Leu 
170 



Leu Ser Ala Leu 
185 



Pro Ala Thr Gin 



Phe Pro Ala Trp 
220 



His Glu His His 
235 



lie Lys Arg Arg 
250 



Ala Leu Asn Val 
110 



Glu Asp Pro Asp 
125 



Phe Ala Ser Phe 



lie Ala Ala Val 
160 



Gin Asn lie Leu 
175 



Gin Leu Phe Thr 
190 



Pro Phe Ala Asp 
205 



Leu Ser Leu Leu 



Leu His Pro Asp 
240 



Ala Leu Glu Arg 
255 
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Arg Asp 



<210> 25 

<211> 777 

<212> DNA 

<213> Nostoc sp. 



<220> 

<221> CDS 

<222> (1) . . (777) 

<400> 25 

atg gtt cag tgt caa cca tea tct ctg cat tea gaa aaa ctg gtg tta 48 
Met Val Gin Cys Gin Pro Ser Ser Leu His Ser Glu Lys Leu Val Leu 
15 10 15 

ttg tea teg aca ate aga gat gat aaa aat att aat aag ggt ata ttt 96 
Leu Ser Ser Thr lie Arg Asp Asp Lys Asn lie Asn Lys Gly lie Phe 
20 25 30 

att gee tgc ttt ate -tta ttt tta tgg gca att agt tta ate tta tta 144 
lie Ala Cys Phe lie Leu Phe Leu Trp Ala lie Ser Leu lie Leu Leu 
35 40 45 

etc tea ata gat aca tec ata att cat aag age tta tta ggt ata gee 192 
Leu Ser lie Asp Thr Ser lie lie His Lys Ser Leu Leu Gly lie Ala 
50 55 60 

atg ctt tgg cag ace ttc tta tat aca ggt tta ttt att act get cat 240 
Met Leu Trp Gin Thr Phe Leu Tyr Thr Gly Leu Phe lie Thr . Ala His 
65 70 75 80 

gat gee atg cac ggc gta gtt tat ccc aaa aat ccc aga ata aat aat 288 
Asp Ala Met His Gly Val Val Tyr Pro Lys Asn Pro Arg lie Asn Asn 
85 90 95 



ttt ata ggt aag etc act eta ate ttg tat gga eta etc cct tat aaa 



336 
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Phe lie Gly 

gat tta ttg 
Asp Leu Leu 
115 

tta gac cct 
Leu Asp Pro 
130 

tat eta cat 
Tyr Leu His 
145 

tta gtg atg 
Leu Val Met 

aat aat tta 
Asn Asn Leu- 

caa eta ttt 
Gin Leu Phe 
195 

ggt tat act 
Gly Tyr Thr 
210 

tgg tct ttt 
Trp Ser Phe 
225 

gaa tac cct 
Glu Tyr Pro 



Lys Leu Thr 
100 

aaa aaa cat 
Lys Lys His 

gat tat tac 
Asp Tyr Tyr 

ttt atg aag 
Phe Met Lys 
150 

att ttt cat 
He Phe His 
165 

att ata ttt 
He He Phe 
180 

tat ttt ggt 
Tyr Phe -Gly 

aac ccc cat 
Asn Pro His 

gtt act tgt 
Val Thr Cys 
230 

caa ctt cct 
Gin Leu Pro 
245 



Leu He Leu 
105 

tgg tta cac 
Trp Leu His 
120 

aat ggt cat 
Asn Gly His 
135 

tct tat tgg 
Ser Tyr Trp 

gga ctt aaa 
Gly Leu Lys 

tgg atg ata 
Trp Met He 
185 

aca ttt ttg 
Thr Phe Leu 
200 

tgt gcg cgc 
Cys Ala Arg 
215 

tat cac ttc 
Tyr His Phe 

tgg tgg aaa 
Trp Trp Lys 



Tyr Gly Leu 

cac gga cat 
His Gly His 

ccc caa aac 
Pro Gin Asn 
140 

cga tgg acg 
Arg Trp Thr 
155 

aat ctg gtg 
Asn Leu Val 
170 

cct tct att 
Pro Ser lie 

cct cat aaa 
Pro His Lys 

agt ate cca 
Ser He Pro 
220 

ggc tac cac 
Gly Tyr His 
235 

tta cct gaa 
Leu Pro Glu 
250 



Leu Pro Tyr 
110 

cct ggt act 
Pro Gly Thr 
125 

ttc ttt ctt 
Phe Phe Leu 



caa att ttc 
Gin He Phe 

cat ata cca 
His He Pro 
175 

tta agt tea 
Leu Ser Ser 
190 

aag eta gaa 
Lys Leu Glu 
205 

tta cct ctt 
Leu Pro Leu 

aag gaa cat 
Lys Glu His 

get cac aaa 
Ala His Lys 
255 



Lys 

gat 384 
Asp 

tgg 432 
Trp 

gga 480 

Gly 

160 

gaa 528 
Glu 



gta 576 
Val 



ggt 624 
Gly 



ttt 672 
Phe 



cac 720 

His 

240 

ata 768 
He 



tct tta taa 
Ser Leu 



777 
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<210> 26 

<211> 258 

<212> PRT 

<213> Nostoc sp. 

<400> 26 

Met Val Gin Cys Gin Pro Ser Ser Leu His Ser Glu Lys Leu Val Leu 
15 10 15 



Leu Ser Ser Thr lie Arg Asp Asp Lys Asn lie Asn Lys Gly lie Phe 
20 25 30 



lie Ala Cys Phe lie Leu Phe Leu Trp Ala lie Ser Leu lie Leu Leu 
35 -40 45 



Leu Ser lie Asp Thr Ser lie lie His Lys Ser Leu Leu Gly lie Ala 
50 55 60 



Met. Leu Trp Gin Thr Phe Leu Tyr Thr Gly Leu Phe lie Thr Ala His 
65 70 75 80 



Asp Ala Met His Gly Val Val Tyr Pro Lys Asn Pro Arg lie Asn Asn 
85 90 95 



Phe lie Gly Lys Leu Thr Leu lie Leu Tyr Gly Leu Leu Pro Tyr Lys 
100 105 110 



Asp Leu Leu Lys Lys His Trp Leu His His Gly His Pro Gly Thr Asp 
115 120 125 
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Leu Asp Pro Asp 
130 

Tyr Leu His Phe 
145 



Leu Val Met lie 



Asn Asn Leu lie 
180 



Gin Leu Phe Tyr 
195 



Gly Tyr Thr Asn 
210 



Trp Ser Phe Val 

225 



Glu Tyr Pro Gin 



Ser Leu 



Tyr Tyr Asn Gly 
135 



Met Lys Ser Tyr 
150 



Phe His Gly Leu 
165 



lie Phe Trp Met 



Phe Gly Thr Phe 
200 



Pro His Cys Ala 
215 



Thr Cys Tyr His 
230 



Leu Pro Trp Trp 
245 



His Pro Gin Asn 
140 



Trp Arg Trp Thr 
155 



Lys Asn Leu Val 
170 



lie Pro Ser lie 
185 



Leu Pro His Lys 



Arg Ser lie Pro 
220 



Phe Gly Tyr His 
235 



Lys Leu Pro Glu 
250 



Phe Phe Leu Trp 



Gin He Phe Gly 
160 



His He Pro Glu 
175 



Leu Ser Ser Val 
190 



Lys Leu Glu Gly 
205 



Leu Pro Leu Phe 



Lys Glu His His 
240 



Ala His Lys He 
255 



<210> 27 
<211> 789 
<212> DNA 
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<213> Nostoc punctiforme 



<220> 

<221> CDS 

<222> (1) . . (789) 

<400> 27 

ttg aat ttt tgt gat aaa cca gtt age tat tat gtt gca ata gag caa 48 

Leu Asn Phe Cys Asp Lys Pro Val Ser Tyr Tyr Val Ala lie Glu Gin 
15 10 15 



tta agt get aaa gaa gat 
Leu Ser Ala Lys Glu Asp 
20 

att att agt ctt tgg gta 
lie lie Ser Leu Trp Val 
35 

tat gee aaa gtc cca att 
Tyr Ala Lys Val Pro lie 
50 



act gtt tgg ggg ctg gtg 
Thr Val Trp Gly Leu Val 
25 

get agt ttg get ttt tta 
Ala Ser Leu Ala Phe Leu 
40 

tgg ttg ata cct att gca 
Trp Leu lie Pro lie Ala 
55 60 



att gtc ata gta 96 
He Val He Val 
30 

eta get att aat 144 

Leu Ala He Asn 

45 

ata gtt tgg caa 192 
He Val Trp Gin 



atg ttc ctt 
Met Phe Leu 
65 

ggg tea gtt 
Gly Ser Val 

eta get gta 
Leu Ala Val 



tat aca "ggg 
Tyr Thr Gly 
70 

tat cgt aaa 
Tyr Arg Lys 
85 

gcg ctt tac 
Ala Leu Tyr 
100 



eta ttt att 
Leu Phe He 

aat ccc aaa 
Asn Pro Lys 

get gtg ttt 
Ala Val Phe 
105 



act gca cat 
Thr Ala His 
75 

att aat aat 
He Asn Asn 
90 

cca tat caa 
Pro Tyr Gin 



gat get atg 
Asp Ala Met 

ttt ate ggt 
Phe He Gly 
95 

cag atg tta 
Gin Met Leu 
110 



cat 240 

His 

80 

tea 288 
Ser 

aag 336 
Lys 



aat cat tgc tta cat cat cgt cat cct get. age gaa gtt gac cca gat 384 
Asn His Cys Leu His His Arg His Pro Ala Ser Glu Val Asp Pro Asp 
115 120 125 



ttt cat gat ggt aag aga aca aac get att ttc tgg tat .etc cat ttc 



432 
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Phe His Asp 
130 

atg ata gaa 
Met lie Glu 
145 

ttt aat tta 
Phe Asn Leu 

tta ttt tgg 
Leu Phe Trp 



ttc gga aca 
Phe Gly Thr 
195 

ccc cat tgc 
Pro His Cys 
210 

get tgc tac 
Ala Cys Tyr 
225 

gta cct tgg 
Val Pro Trp 

aat tea gta 
Asn Ser Val 



Gly Lys Arg 

tac tec agt 
Tyr Ser Ser 
150 

get aaa tac 
Ala Lys Tyr 
165 

agt att cct 
Ser lie Pro 
180 

ttt ttg cct 
Phe Leu Pro 



age caa aca 
Ser Gin Thr 



cac ttt ggt 
His Phe -Gly 
230 

tgg caa ctt 
Trp Gin Leu 
245 

acc aat teg 
Thr Asn Ser 
260 



Thr Asn Ala 
135 

tgg caa cag 
Trp Gin Gin 

gtt ttg cac 
Val Leu His 

cca att tta 
Pro lie Leu 
185 

cat cga gaa 
His Arg Glu 
200 

ata aaa ttg 
lie Lys Leu 
215 

tat cat gaa 
Tyr His Glu 

cca tct gta 
Pro Ser Val 



taa 



lie Phe Trp 
140 

tta ata gta 
Leu lie Val 
155 

ate cat caa 
He His Gin 
170 

agt tec att 
Ser Ser He 



ccc aag aaa 
Pro Lys Lys 

cca act ttt 
Pro Thr Phe 
220 

gaa cat cat 
Glu His His 
235 

tat aag cag 
Tyr Lys Gin 
250 



Tyr Leu His 

eta act ate 
Leu Thr He 

ata aat etc 
He Asn Leu 
175 

caa ctg ttt 
Gin Leu Phe 
190 

gga tat gtt 
Gly Tyr Val 
205 

ttg tea ttt 
Leu Ser Phe 

gag tat ccc 
Glu Tyr Pro 

aga gta ttc 
Arg Val Phe 
255 



Phe 

eta 480 

Leu 

160 

ate 528 
He 

tat 576 
Tyr 

tat 624 
Tyr 

ate 672 
He 



cat 720 

His 

240 

aac 768 
Asn 

789 



<210> 28 

<211> 262 

<212> PRT 

<213> Nostoc 



punctif orme 
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<400> 28 

Leu Asn Phe Cys 
1 



Leu Ser Ala Lys 
20 



lie lie Ser Leu 
35 



Tyr Ala Lys Val 
50 



Met Phe Leu Tyr 
65 



Gly Ser Val Tyr 



Leu Ala Val Ala 
100 



Asn His Cys Leu 
115 



Phe His Asp Gly 
130 



Met lie Glu Tyr 
145 



Asp Lys Pro Val 
5 



Glu Asp Thr Val 



Trp Val Ala Ser 
40 



Pro lie Trp Leu 
55 



Thr Gly Leu Phe 
70 



Arg Lys Asn Pro 
85 



Leu Tyr Ala Val 



His His Arg His 
120 



Lys Arg Thr Asn 
135 



Ser Ser Trp Gin 
150 



Ser Tyr Tyr Val 
10 



Trp Gly Leu Val 
25 



Leu Ala Phe Leu 



He Pro He Ala 
60 



He Thr Ala His 
75 



Lys He Asn Asn 
90 



Phe Pro Tyr Gin 
105 



Pro Ala Ser Glu 



Ala He Phe Trp 
140 



Gin Leu He Val 
155 



Ala He Glu Gin 
15 



He Val He Val 
30 



Leu Ala lie Asn 
45 



He Val Trp Gin 



Asp Ala Met His 
80 



Phe He Gly Ser 
95 



Gin Met Leu Lys 
110 



Val Asp Pro Asp 
125 



Tyr Leu His Phe 



Leu Thr He Leu 
160 
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Phe Asn Leu Ala Lys Tyr Val Leu His lie His Gin lie Asn Leu lie 
165 170 175 



Leu Phe Trp Ser lie Pro Pro lie Leu Ser Ser lie Gin Leu Phe Tyr 
180 185 190 



Phe Gly Thr Phe Leu Pro His Arg Glu Pro Lys Lys Gly Tyr Val Tyr 
195 200 205 



Pro His Cys Ser Gin Thr lie Lys Leu Pro Thr Phe Leu Ser Phe lie 
210 215 220 



Ala Cys Tyr His Phe Gly Tyr His Glu Glu His His Glu Tyr Pro His 
225 230 235 240 



Val Pro Trp Trp Gin Leu Pro Ser Val Tyr Lys Gin Arg Val Phe Asn 
245 250 255 



Asn Ser Val Thr Asn Ser 
260 



<210> 29 

<211> 762 

<212> DNA 

<213> Nostoc puncti.f orme 
<220> 

<221> CDS 

<222> (1) . . (762) 

<400> 29 

gtg ate cag tta gaa caa cca etc agt cat caa gca aaa ctg act cca 48 



BASF AG 

BASF NAE 877/03 



71/365 



January 08, 2004 



Val lie Gin Leu Glu Gin Pro Leu Ser His Gin Ala Lys Leu Thr Pro 
15 10 15 

gta ctg aga agt aaa tct cag ttt aag ggg ctt ttc att get att gtc 96 
Val Leu Arg Ser Lys Ser Gin Phe Lys Gly Leu Phe lie Ala lie Val 
20 25 30 

att gtt age gca tgg gtc att age ctg agt tta tta ctt tec ctt gac 144 
lie Val Ser Ala Trp Val lie Ser Leu Ser Leu Leu Leu Ser Leu Asp 
35 40 45 

ate tea aag eta aaa ttt tgg atg tta ttg cct gtt ata eta tgg caa 192 
lie Ser Lys Leu Lys Phe Trp Met Leu Leu Pro Val lie Leu Trp Gin 
50 55 60 

aca ttt tta tat acg gga tta ttt att aca tct cat gat gec atg cat 240 
Thr Phe Leu Tyr Thr Gly Leu Phe lie Thr Ser His Asp Ala Met His 
65 70 75 80 

ggc gta gta ttt ccc caa aac .acc aag att aat cat ttg att gga aca 288 
Gly Val Val Phe Pro Gin Asn Thr Lys lie Asn His Leu lie Gly Thr 
85 90 95 

ttg acc eta tec ctt tat ggt ctt tta cca tat caa aaa eta ttg aaa 336 
Leu Thr Leu Ser Leu -Tyr Gly Leu Leu Pro Tyr Gin Lys Leu Leu Lys 
100 105 110 

aaa cat tgg tta cac cac cac aat cca gca age tea ata gac ccg gat 384 
Lys His Trp Leu His His His Asn Pro Ala Ser Ser lie Asp Pro Asp 
115 120 125 

ttt cac aat ggt aaa cac caa agt ttc ttt get tgg tat ttt cat ttt 432 
Phe His Asn Gly Lys His Gin Ser Phe Phe Ala Trp Tyr Phe His Phe 
130 135 140 

atg aaa ggt tac tgg agt tgg ggg caa ata att gcg ttg act att att 480 
Met Lys Gly Tyr Trp Ser Trp Gly Gin lie lie Ala Leu Thr lie lie 
145 150 155 160 



tat aac ttt get aaa tac ata etc cat ate cca agt gat aat eta act 528 
Tyr Asn Phe Ala Lys Tyr lie Leu His lie Pro Ser Asp Asn Leu Thr 
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165 170 175 

tac ttt tgg gtg eta ccc teg ctt tta agt tea tta caa tta ttc tat 576 

Tyr Phe Trp Val Leu Pro Ser Leu Leu Ser Ser Leu Gin Leu Phe Tyr 
180 185 190 

ttt ggt act ttt tta ccc cat agt gaa cca ata ggg ggt tat gtt cag 624 

Phe Gly Thr Phe Leu Pro His Ser Glu Pro He Gly Gly Tyr Val Gin 
195 200 205 

cct cat tgt gee caa aca att age cgt cct att tgg tgg tea ttt ate 672 

Pro His Cys Ala Gin Thr He Ser Arg Pro He Trp Trp Ser Phe He 

210 215 220 

acg tgc tat cat ttt ggc tac cac gag gaa cat cac gaa tat cct cat 720 

Thr Cys Tyr His Phe Gly Tyr His Glu Glu His His Glu Tyr Pro His 
225 230 235 240 

att tct tgg tgg cag tta cca gaa att tac aaa gca aaa tag 762 

He Ser Trp Trp Gin Leu Pro Glu He Tyr Lys Ala Lys 
245 250 



<210> 30 
<211> 253 

<212> PRT 

<213> Nostoc punctiforme 
<400> 30 

Val He Gin Leu Glu Gin Pro Leu Ser His Gin Ala Lys Leu Thr Pro 
15 10 15 



Val Leu Arg Ser Lys Ser Gin Phe Lys Gly Leu Phe He Ala He Val 
20 25 30 



lie Val Ser Ala Trp Val He Ser Leu Ser Leu Leu Leu Ser Leu Asp 
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35 



lie Ser Lys Leu 
50 



Thr Phe Leu Tyr 
65 



Gly Val Val Phe 



Leu Thr Leu Ser 
100 



Lys His Trp Leu 
115 



Phe His Asn Gly 
130 



Met Lys Gly Tyr 
145 



Tyr Asn Phe Ala 



Tyr Phe Trp Val 
180 



40 



Lys Phe Trp Met 
55 



Thr Gly Leu Phe 
70 



Pro Gin Asn Thr 
85 



Leu Tyr Gly Leu 



His His His Asn 
120 



Lys His Gin Ser 
135 



Trp Ser Trp Gly 
150 



Lys Tyr lie Leu 
165 



Leu Pro Ser Leu 



Leu Leu Pro Val 
60 



lie Thr Ser His 
75 



Lys lie Asn His 
90 



Leu Pro Tyr Gin 
105 



Pro Ala Ser Ser 



Phe Phe Ala Trp 
140 



Gin lie He Ala 
155 



His He Pro Ser 
170 



Leu Ser Ser Leu 
185 



45 



He Leu Trp Gin 



Asp Ala Met His 
80 



Leu He Gly Thr 
95 



Lys Leu Leu Lys 
110 



He Asp Pro Asp 
125 



Tyr Phe His Phe 



Leu Thr He He 
160 



Asp Asn Leu Thr 
175 



Gin Leu Phe Tyr 
190 



Phe Gly Thr Phe Leu Pro His Ser Glu Pro He Gly Gly Tyr Val Gin 
195 200 205 
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Pro His Cys Ala Gin Thr lie Ser Arg Pro lie Trp Trp Ser Phe lie 
210 215 220 



Thr Cys Tyr His Phe Gly Tyr His Glu Glu His His Glu Tyr Pro His 
225 230 235 240 



lie Ser Trp Trp Gin Leu Pro Glu lie Tyr Lys Ala Lys 
245 250 



<210> 31 

<211> 1608 

<212> DNA 

<213> Haematococcus pluvialis 



<220> 

<221> CDS 

<222> (3) . . (971) 

<400> 31 

ct aca ttt cac aag ccc gtg age ggt gca age get ctg ccc cac ate 47 

Thr Phe His Lys Pro Val Ser Gly Ala Ser Ala Leu Pro His lie 
15 10 15 

ggc cca cct cct cat etc cat egg tea ttt get get ace acg atg ctg 95 
Gly Pro Pro Pro His Leu His Arg Ser Phe Ala Ala Thr Thr Met Leu 
20 25 30 

teg aag ctg cag tea ate age gtc aag gee cgc cgc gtt gaa eta gee 143 
Ser Lys Leu Gin Ser lie Ser Val Lys Ala Arg Arg Val Glu Leu Ala 
35 40 45 

cgc gac ate acg egg ccc aaa gtc tgc ctg cat get cag egg tgc teg 191 
Arg Asp lie Thr Arg Pro Lys Val Cys Leu His Ala Gin Arg Cys Ser 
50 55 60 
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tta gtt egg 
Leu Val Arg 
65 

acc gtg cag 
Thr Val Gin 
80 

etc cag cag 
Leu Gin Gin 



egg gag cag 
Arg Glu Gin 



gtg tea ggc 
Val Ser Gly 
130 

atg acc gtg 
Met Thr Val 
145 

etc ttg gtg 
Leu Leu Val 
160 

gca cac aaa 
Ala His Lys 

aag age cac 
Lys Ser His 

ttt gca ate 
Phe Ala He 
210 

ttc tgg ctg 



ctg cga gtg 
Leu Arg Val 

get gee ggc 
Ala Ala Gly 
85 

ctt gac egg 
Leu Asp Arg 
100 

ctg tea tac 
Leu Ser Tyr 
115 

att gee ate 
He Ala He 



ggc ggc gca 
Gly Gly Ala 



gtt ggt "ggc 
Val Gly Gly 
165 

gee ate tgg 
Ala He Trp 
180 

cac aca cct 
His Thr Pro 
195 

ate aat gga 
He Asn Gly 

ccc aac gtc 



gca gca cca 
Ala Ala Pro 
70 

gcg ggc gat 
Ala Gly Asp 



get ate gca 
Ala He Ala 



cag get gee 
Gin Ala Ala 
120 

ttc gee acc 
Phe Ala Thr 
135 

gtg cca tgg 
Val Pro Trp 
150 

gcg etc ggc 
Ala Leu Gly 

cat gag teg 
His Glu Ser 

cgc act gga 
Arg -Thr Gly 
200 

ctg ccc gee 
Leu Pro Ala 
215 

ctg ggg gcg 



cag aca gag 
Gin Thr Glu 
75 

gag cac age 
Glu His Ser 
90 

gag cgt cgt 
Glu Arg Arg 
105 

gee att gca 
Ala He Ala 

tac ctg aga 
Tyr Leu Arg 

ggt gaa gtg 
Gly Glu Val 
155 

atg gag atg 
Met Glu Met 
170 

cct ctg ggc 
Pro Leu Gly 
185 

ccc ttt gaa 
Pro Phe Glu 

atg etc ctg 
Met Leu Leu 

gee tgc ttt 



gag gcg ctg 
Glu Ala Leu 



gee gat gta 
Ala Asp Val 



gee egg cgc 
Ala Arg Arg 
110 

gca tea att 
Ala Ser He 
125 

ttt gee atg 
Phe Ala Met 
140 

get ggc act 
Ala Gly Thr 

tat gee cgc 
Tyr Ala Arg 

tgg ctg ctg 
Trp Leu Leu 
190 

gee aac gac 
Ala Asn Asp 
205 

tgt acc ttt 

Cys Thr Phe 
220 

gga gcg ggg 



gga 239 
Gly 

gca 287 

Ala 

95 

aaa 335 
Lys 

ggc 383 
Gly 



cac 431 
His 

etc 479 
Leu 

tat 527 

Tyr 

175 

cac 575 
His 



ttg 623 
Leu 



ggc 671 
Gly 

ctg 719 



i 
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Phe Trp Leu Pro Asn Val Leu Gly Ala Ala Cys Phe Gly Ala Gly Leu 

225 230 235 



ggc ate acg eta tac ggc atg gca tat atg ttt gta cac gat ggc ctg 767 
Gly lie Thr Leu Tyr Gly Met Ala Tyr Met Phe Val His Asp Gly Leu 
240 245 250 255 



gtg cac agg cgc ttt ccc acc ggg ccc ate get ggc ctg ccc tac atg 815 
Val His Arg Arg Phe Pro Thr Gly Pro lie Ala Gly Leu Pro Tyr Met 
260 265 270 

aag cgc ctg aca gtg gee cac cag eta cac cac age ggc aag tac ggt 863 
Lys Arg Leu Thr Val Ala His Gin Leu His His Ser Gly Lys Tyr Gly 
275 280 285 



ggc gcg ccc tgg ggt atg ttc ttg ggt cca cag gag ctg cag cac att 
Gly Ala Pro Trp Gly Met Phe Leu Gly Pro Gin Glu Leu Gin His lie 
290 295 300 



911 



cca ggt gcg gcg gag gag gtg gag cga ctg gtc ctg gaa ctg gac tgg 959 
Pro Gly Ala* Ala Glu Glu Val Glu Arg Leu Val Leu Glu Leu Asp Trp 
305 310 315 



tec aag egg tag ggtgcggaac caggcacgct ggtttcacac ctcatgcctg 1011 

Ser Lys Arg 

320 

tgataaggtg tggctagagc gatgcgtgtg agaegggtat gtcaeggteg actggtctga 1071 

tggccaatgg catcggccat gtctggtcat caegggctgg ttgcctgggt gaaggtgatg 1131 

cacatcatca tgtgcggttg gaggggctgg cacagtgtgg gctgaactgg agcagttgtc 1191 

caggctggcg ttgaatcagt gagggtttgt gattggcggt tgtgaagcaa tgactccgcc 1251 

catattctat ttgtgggagc tgagatgatg geatgettgg gatgtgcatg gatcatggta 1311 

gtgcagcaaa ctatattcac ctagggctgt tggtaggatc aggtgaggee ttgcacattg 1371 



catgatgtac tcgtcatggt gtgttggtga gaggatggat gtggatggat gtgtattctc 1431 
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agacgtagac cttgactgga ggcttgatcg agagagtggg ccgtattctt tgagagggga 14 91 
ggctcgtgcc agaaatggtg agtggatgac tgtgacgctg tacattgcag gcaggtgaga 1551 
tgcactgtct cgattgtaaa atacattcag atgcaaaaaa aaaaaaaaaa aaaaaaa 1608 



<210> 32 
<211> 322 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 32 

Thr Phe His Lys Pro Val Ser Gly Ala Ser Ala Leu Pro His lie Gly 
15 10 15 



Pro Pro Pro His Leu His Arg Ser Phe Ala Ala Thr Thr Met Leu Ser 
20 25 30 



Lys Leu Gin Ser lie Ser Val Lys Ala Arg Arg Val Glu Leu Ala Arg 
35 40 45 



Asp lie Thr Arg Pro Lys Val Cys Leu His Ala Gin Arg Cys Ser Leu 
50 55 60 



Val Arg Leu Arg Val Ala Ala Pro Gin Thr Glu Glu Ala Leu Gly Thr 
65 70 75 80 



Val Gin Ala Ala Gly Ala Gly Asp Glu His Ser Ala Asp Val Ala Leu 
85 90 95 



Gin Gin Leu Asp Arg Ala lie Ala Glu Arg Arg Ala Arg Arg Lys Arg 
100 105 110 
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Glu Gin Leu Ser Tyr Gin Ala Ala Ala lie Ala Ala Ser lie Gly Val 
115 120 125 



Ser Gly lie Ala lie Phe Ala Thr Tyr Leu Arg Phe Ala Met His Met 
130 135 140 



Thr Val Gly Gly Ala Val Pro Trp Gly Glu Val Ala Gly Thr Leu Leu 
145 150 155 160 



Leu Val Val Gly Gly Ala Leu Gly Met Glu Met Tyr Ala Arg Tyr Ala 
165 170 175 



His Lys Ala lie Trp His Glu Ser Pro Leu Gly Trp Leu Leu His Lys 
180 185 190 



Ser His His Thr Pro Arg Thr Gly Pro Phe Glu Ala Asn Asp Leu Phe 
195 2C0 205 



Ala lie lie Asn Gly Leu Pro Ala Met Leu Leu Cys Thr Phe Gly Phe 
210 215 220 



Trp Leu Pro Asn Val Leu Gly Ala Ala Cys Phe Gly Ala Gly Leu Gly 
225 230 235 240 



lie Thr Leu Tyr Gly Met Ala Tyr Met Phe Val His Asp Gly Leu Val 
245 250 255 



His Arg Arg Phe Pro Thr Gly Pro lie Ala Gly Leu Pro Tyr Met Lys 
260 265 270 
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Arg Leu Thr Val Ala His Gin Leu His His Ser Gly Lys Tyr Gly Gly 
275 280 285 



Ala Pro Trp Gly Met Phe Leu Gly Pro Gin Glu Leu Gin His lie Pro 
290 295 300 



Gly Ala Ala Glu Glu Val Glu Arg Leu Val Leu Glu Leu Asp Trp Ser 
305 310 315 320 



Lys Arg 



<210> 33 

<211> 528 

<212> DNA 

<213> Erwinia uredovora 



<220> 

<221> CDS 

<222> (1) . . (528) 



<400> 33 
atg ttg tgg 
Met Leu Trp 
1 

atg gaa gtg 
Met Glu Val 



ggt tgg gga 
Gly Trp Gly 
35 

gaa gtt aac 
Glu Val Asn 



att tgg aat 
lie Trp Asn 
5 

att get gca 
lie Ala Ala 
20 

tgg cat ctt 
Trp His Leu 

gat ctt tat 
Asp Leu Tyr 



gec ctg ate 
Ala Leu lie 

ctg gca cac 
Leu Ala His 
25 

tea cat cat 
Ser His His 
40 

gec gtg gtt 
Ala Val Val 



gtt ttc gtt 
Val Phe Val 
10 

aaa tac ate 
Lys Tyr lie 

gaa ccg cgt 
Glu Pro Arg 

ttt get gca 
Phe Ala Ala 



ace gtg att 
Thr Val lie 
15 

atg cac ggc 
Met His Gly 
30 

aaa ggt gcg 
Lys Gly Ala 
45 

tta teg ate 
Leu Ser lie 



ggc 4 8 
Gly 

tgg 96 
Trp 

ttt 144 
Phe 



ctg 192 
Leu 



BASF AG 80/365 January 08, 2004 

BASF NAE 877/03 

50 55 60 

ctg att tat ctg ggc agt aca gga atg tgg ccg etc cag tgg att ggc 240 

Leu lie Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp lie Gly 

65 70 75 80 



gca ggt atg 
Ala Gly Met 



ctg gtg cat 
Leu Val His 

etc aaa egg 
Leu Lys Arg 
115 

aaa gaa ggt 
Lys Glu Gly 
130 



acg gcg tat 
Thr Ala Tyr 
85 

caa cgt tgg 
Gin Arg Trp 
100 

ttg tat atg 
Leu Tyr Met 

tgt gtt tct 
Cys Val Ser 



gga tta etc 
Gly Leu Leu 

cca ttc cgc 
Pro Phe Arg 
105 

gcg cac cgt 
Ala His Arg 
120 

ttt ggc ttc 
Phe Gly Phe 
135 



tat ttt atg 
Tyr Phe Met 
90 

tat att cca 
Tyr lie Pro 

atg cat cac 
Met His His 



etc tat gcg 
Leu Tyr Ala 
140 



gtg cac gac 
Val His Asp 
95 

cgc aag ggc 
Arg Lys Gly 
110 

gee gtc agg 
Ala Val Arg 
125 

ccg ccc ctg 
Pro Pro Leu 



ggg 288 
Gly 

tac 336 
Tyr 

ggc 384 
Gly 

tea 432 
Ser 



aaa ctt cag gcg acg etc egg gaa aga cat ggc get aga gcg ggc get 480 
Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly Ala 
145 -150 155 160 



gee aga gat gcg cag ggc ggg gag gat gag ccc gca tec ggg aag taa 528 
Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 175 



<210> 34 

<211> 175 

<212> PRT 

<213> Erwinia uredovora 

<400> 34 



Met Leu Trp lie Trp Asn Ala Leu He Val Phe Val Thr Val He Gly 
15 10 15 
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Met Glu Val He 
20 



Gly Trp Gly Trp 
35 



Glu Val Asn Asp 
50 



Leu He Tyr Leu 
65 



Ala Gly Met Thr 



Leu Val His Gin 
100 



Leu Lys Arg Leu 
115 



Lys Glu Gly Cys 
130 



Lys Leu Gin Ala 
145 



Ala Arg Asp Ala 



Ala Ala Leu Ala 



His Leu Ser His 
40 



Leu Tyr Ala Val 
55 



Gly Ser Thr Gly 
70 



Ala Tyr Gly Leu 
85 



Arg Trp Pro Phe 



Tyr Met Ala His 
120 



Val Ser Phe Gly 
135 



Thr Leu Arg Glu 
150 



Gin Gly Gly Glu 
165 



His Lys Tyr He 
25 



His Glu Pro Arg 



Val Phe Ala Ala 
60 



Met Trp Pro Leu 
75 



Leu Tyr Phe Met 
90 



Arg Tyr He Pro 
105 



Arg Met His His 



Phe Leu Tyr Ala 

• 140 



Arg His Gly Ala 
155 



Asp Glu Pro Ala 
170 



Met His Gly Trp 
30 



Lys Gly Ala Phe 
45 



Leu Ser He Leu 



Gin Trp He Gly 
80 



Val His Asp Gly 
95 



Arg Lys Gly Tyr 
110 



Ala Val Arg Gly 
125 



Pro Pro Leu Ser 



Arg Ala Gly Ala 
160 



Ser Gly Lys 
175 
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<210> 35 
<211> 1520 
<212> DNA 
<213> Artificial 

<220> 

<223> Promoter 
<400> 35 

ctcgagtacc gaggcggaac ggcaggaatg tttccctctc ttttagaggg caattcttta 60 

tccaatgtca tgttgatgct agatatttct gtctcttata ataaggcgaa tacccatttt 120 

tgaattgaag ttgagataaa aaaaaagggg gcccaatttg tcaacgccaa agagtcaagc 180 

tttttctttg gctttagccg aacaatctaa gacttattgt ttttgaagat atttgacctt 240 

ttctagatat tccttcaagt aaagcttttt tcgagttttt tttttttttc tttgtgaagg 300 

atttattgtt -attggtatcc attttttatt ggaagacaag ataagttaat attgattttg 360 
cttaaagatt aaaaggaaat cagaaaacga caataaaaaa tgtaacggac aaactatggt . 420 

gtcgattata agtctaaatc cttaaaaaat gacaacgagt tgctttcctc tgaaaacaat 480 

tcttttgtct ttgcaagaaa ggtttctttt ttgtttgctt gcattactta aacatcaaat 540 

caaatgaaag gaataaagca gatttgaggg cgaataagga ttttctggtc aacaagatgt 600 

gagtgacacc taaggaacta aatgccattc atttgtttta aaacgacatc aaagattgat 660 

gatcaacagg attgagagag agaaaaagaa ctcgtgtcat ttatttctgt tgactgaaat 720 

tttatattta gaaaaaatgt caaatctata gctttagcta tattacataa catttgaaat 780 

aataataata aaaaaagaca cattagagac acttttcaaa ctctaaataa ctgtctataa 840 

acacaaagaa aacaaagacc tctataacaa cttattagat ttttctcgta cttttgtcta 900 

aagatgatgt attcttgtta tcccacactt ctttcatttg ttcttgatgc tactaaatat 960 
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acaaaatttc ttttttgcaa gagatattat tccaaaaatt ttcaaaaaga aatttttttc 1020 

acaatagcag ttgatcgtgt aacccaaaga ggttctttgt tattttgcac ttccgctttg 1080 

cggtgatgca tattcaaagt aatatatgga ataaacaacg tgtttaagca tgaaagaaag 1140 

gaaacaaagg ccgctttgaa caaatgcata atatttcaga caaaaatgat ctaaagcaag 1200 

cagtaaatca aacaagaaac attgctgatt cgcgttagaa aacgataaaa gtctaataag 1260 

ccactaagta tacttcaatg aactttttgt atgcttatgg tccaatcaga ccaataattt 1320 

gtgaccattc ctgaggtggc tttggtgatg cggaaacaga aaaaaatttt ctcaccaatc 1380 

gatttaaaaa acaatttctg ctttgaacca aaactttttt tttctcttta atcattaact 1440 

ttatcaagta tgtacctacc ctcaaagtcc tcactcaagc acaattatgc taacattgtt 1500 

ccaccttctc tttagaaatg 1520 



<210> 36 

<211> 16245 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_f eature 
<222> (10563) . . (10563) 
<223> n is a, c, g, or t 

<400> 36 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 



acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 
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acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 
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taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 
tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 
tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 
catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 
tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 
tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 
tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 
attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 
cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 
ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 
agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 
cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 
tgacttactg gggatcaa^c ctgattggga gaaaataaaa tattatattt tactggatga 
attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 
tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 
ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 
cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 
gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 
gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 
ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 
aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
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gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 
acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg ■ 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 
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ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctaca-g gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 
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tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 
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cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttat.tccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7 620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 
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gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 
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gtggttggct tgtatggagc agcagacgcg 
aggatcgccg cggctccggg cgtatatgct 
cttggttgac ggcaatttcg atgatgcagc 
ccgatccgga gccgggactg tcgggcgtac 
gaccgatggc tgtgtagaag tactcgccga 
gagggcaaag gaatagagta gatgccgacc 
tcatcaaaca gcttgacgaa tctggatata 
ttgagacaaa tggtgttcag gatctcgata 
gtgccttcta gtgatttaat agctccatgt 
cctcttccag atacagctca tctgcaatgc 
cttncaggct ccggcgaaga gaagaatagc 
gagatcaagc agatcaacgg tcgtcaagag 
tccacgcgac tatatatttg tctctaattg 
atagcttgac tatgaaaatt ccgtcaccag 
tcttccttga actctcaagc ctacaggaca 
canttcctac taagatggta tacaatagta 
taacacccaa tacgccggcc gaaacttttt 
atgcacaggt acacttgttt agaggtaatc 
gtgtaagcgc ccactccaca tctccactcg 
atgctccata gactcacatt gatattgtcg 
tacaaaagtt agcagagaag catgatttct 
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ctacttcgag cggaggcatc cggagcttgc 9720 

ccgcattggt cttgaccaac tctatcagag 9780 

ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

acaaatcgcc cgcagaagcg cggccgtctg 9900 

tagtggaaac cgacgcccca gcactcgtcc 9960 

gcgggatcga tccacttaac gttactgaaa 10020 

agatcgttgg tgtcgatgtc agctccggag 10080 

agatacgttc atttgtccaa gcagcaaaga 10140 

caacaagaat aaaacgcgtt ttcgggttta 10200 

attaatgcat tgactgcaac ctagtaacgc 10260 

ttagcagagc tattttcatt ttcgggagac 10320 

acctacgaga ctgaggaatc cgctcttggc 10380 

tactttgaca tgctcctctt ctttactctg 10440 

cncctgggtt cgcaaagata attgcatgtt 10500 

cacattcatc gtaggtataa acctcgaaat 10560 

accatgcatg gttgcctagt gaatgctccg 10620 

tacaactctc ctatgagtcg tttacccaga 10680 

cttctttcta gctagaagtc ctcgtgtact 10740 

acctgcaggc atgcaagctt aatctataca 10800 

aagatttcga tgctgactta gtagagcaac 10860 

taatctttga agaccgcaag tttgcagata 10920 
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tcggtatgtg aattctatct attttttttc tgatgtgtgc atggatgact catgatcata 10980 

ttcttaggta atactgtcaa gcatcaatat ggcaagggcg tttacaagat tgcttcttgg 11040 

tctcatatta ctaatgctca cacagttcct ggagaaggta ttatcaaggg acttgccgaa 11100 

gtcggcctcc ctcttggtcg tggcttgctt ttgctagcag aaatgtcatc tcaaggtgca 11160 

ttaactaagg gtatttacac tgccgaatct gtcaatatgg ctcgccgcaa caaagatttc 11220 

gtttttggct ttattgcaca acacaaaatg aatcagtatg atgatgagga ttttgttgtc 11280 

atgtcgcctg aagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc 11340 

cgctcacaat tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct 11400 

aatgagtgag ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa 114 60 

acctgtcgtg ccagctgcat taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta .11520 

ttgggccaaa gacaaaaggg cgacattcaa ccgattgagg gagggaaggt aaatattgac 11580 

ggaaattatt cattaaaggt gaattatcac cgtcaccgac ttgagccatt tgggaattag 11640 

agccagcaaa atcaccagta gcaccattac cattagcaag gccggaaacg tcaccaatga 11700 

aaccatcgat agcagcaccg taatcagtag cgacagaatc aagtttgcct ttagcgtcag 11760 

actgtagcgc gttttcatcg gcattttcgg tcatagcccc cttattagcg tttgccatct 11820 

tttcataatc aaaatcaccg gaaccagagc caccaccgga accgcctccc tcagagccgc 11880 

caccctcaga accgccaccc tcagagccac caccctcaga gccgccacca gaaccaccac 11940 

cagagccgcc gccagcattg acaggaggcc cgatctagta acatagatga caccgcgcgc 12000 

gataatttat cctagtttgc gcgctatatt ttgttttcta tcgcgtatta aatgtataat 12060 

tgcgggactc taatcataaa aacccatctc ataaataacg tcatgcatta catgttaatt 12120 
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attacatgct taacgtaatt caacagaaat tatatgataa tcatcgcaag accggcaaca 12180 

ggattcaatc ttaagaaact ttattgccaa atgtttgaac gatcggggat catccgggtc 12240 

tgtggcggga actccacgaa aatatccgaa cgcagcaaga tatcgcggtg catctcggtc 12300 

ttgcctgggc agtcgccgcc gacgccgttg atgtggacgc cgggcccgat catattgtcg 12360 

ctcaggatcg tggcgttgtg cttgtcggcc gttgctgtcg taatgatatc ggcaccttcg 12420 

accgcctgtt ccgcagagat cccgtgggcg aagaactcca gcatgagatc cccgcgctgg 12480 

aggatcatcc agccggcgtc ccggaaaacg attccgaagc ccaacctttc atagaaggcg 12540 

gcggtggaat cgaaatctcg tgatggcagg ttgggcgtcg cttggtcggt catttcgaac 12600 

cccagagtcc cgctcagaag aactcgtcaa gaaggcgata gaaggcgatg cgctgcgaat 12660 

cgggagcggc gataccgtaa agcacgagga agcggtcagc ccattcgccg ccaagctctt '12720 

cagcaatatc 'acgggtagcc aacgctatgt cctgatagcg gtccgccaca cccagccggc 12780 

cacagtcgat gaatccagaa aagcggccat tttccaccat gatattcggc aagcaggcat 12840 

cgccatgggt cacgacgaga tcatcgccgt cgggcatgcg cgccttgagc ctggcgaaca 12900 

gttcggctgg cgcgagcccc tgatgctctt cgtccagatc atcctgatcg acaagaccgg 12960 

cttccatccg agtacgtgct cgctcgatgc gatgtttcgc ttggtggtcg aatgggcagg 13020 

tagccggatc aagcgtatgc agccgccgca ttgcatcagc catgatggat actttctcgg 13080 

caggagcaag gtgagatgac aggagatcct gccccggcac ttcgcccaat agcagccagt 13140 

cccttcccgc ttcagtgaca acgtcgagca cagctgcgca aggaacgccc gtcgtggcca 13200 

gccacgatag ccgcgctgcc tcgtcctgca gttcattcag ggcaccggac aggtcggtct 13260 

tgacaaaaag aaccgggcgc ccctgcgctg acagccggaa cacggcggca tcagagcagc 13320 

cgattgtctg ttgtgcccag tcatagccga atagcctctc cacccaagcg gccggagaac 13380 
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ctgcgtgcaa tccatcttgt tcaatcatgc gaaacgatcc agatccggtg cagattattt 13440 

ggattgagag tgaatatgag actctaattg gataccgagg ggaatttatg gaacgtcagt 13500 

ggagcatttt tgacaagaaa tatttgctag ctgatagtga ccttaggcga cttttgaacg 13560 

cgcaataatg gtttctgacg tatgtgctta gctcattaaa ctccagaaac ccgcggctga 13620 

gtggctcctt caacgttgcg gttctgtcag ttccaaacgt aaaacggctt gtcccgcgtc 13680 

atcggcgggg gtcataacgt gactccctta attctccgct catgatcaga ttgtcgtttc 13740 

ccgccttcag tttaaactat cagtgtttga caggatatat tggcgggtaa acctaagaga 13800 

aaagagcgtt tattagaata atcggatatt taaaagggcg tgaaaaggtt tatccgttcg 13860 

tccatttgta tgtgcatgcc aaccacaggg ttccccagat ctggcgccgg ccagcgagac 13920 

gagcaagat.t ggccgccgcc cgaaacgatc cgacagcgcg cccagcacag gtgcgcaggc 13980 

aaattgcacc aacgcataca gcgccagcag aatgccatag tgggcggtga cgtcgttcga 14040 

gtgaaccaga tcgcgcagga ggcccggcag caccggcata atcaggccga tgccgacagc 14100 

gtcgagcgcg acagtgctca gaattacgat caggggtatg ttgggtttca cgtctggcct 14160 

ccggaccagc ctccgctggt ccgattgaac gcgcggattc tttatcactg ataagttggt 14220 

ggacatatta tgtttatcag tgataaagtg tcaagcatga caaagttgca gccgaataca 14280 

gtgatccgtg ccgccctgga cctgttgaac gaggtcggcg tagacggtct gacgacacgc 14340 

aaactggcgg aacggttggg ggttcagcag ccggcgcttt actggcactt caggaacaag 14400 

cgggcgctgc tcgacgcact ggccgaagcc atgctggcgg agaatcatac gcattcggtg 144 60 

ccgagagccg acgacgactg gcgctcattt ctgatcggga atgcccgcag cttcaggcag 14520 

gcgctgctcg cctaccgcga tggcgcgcgc atccatgccg gcacgcgacc gggcgcaccg 14580 
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cagatggaaa cggccgacgc gcagcttcgc ttcctctgcg aggcgggttt ttcggccggg 14640 

gacgccgtca atgcgctgat gacaatcagc tacttcactg ttggggccgt gcttgaggag 14700 

caggccggcg acagcgatgc cggcgagcgc ggcggcaccg ttgaacaggc tccgctctcg 14760 

ccgctgttgc gggccgcgat agacgccttc gacgaagccg gtccggacgc agcgttcgag 14820 

cagggactcg cggtgattgt cgatggattg gcgaaaagga ggctcgttgt caggaacgtt 14880 

gaaggaccga gaaagggtga cgattgatca ggaccgctgc cggagcgcaa cccactcact 14940 

acagcagagc catgtagaca acatcccctc cccctttcca ccgcgtcaga cgcccgtagc 15000 

agcccgctac gggctttttc atgccctgcc ctagcgtcca agcctcacgg ccgcgctcgg 15060 

cctctctggc ggccttctgg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg 15120 

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 15180 

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 15240 

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 15300 

aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 15360 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 15420 

tgtccgcctt tctcccttcg ggaagcgtgg cgcttttccg ctgcataacc ctgcttcggg 15480 

gtcattatag cgattttttc ggtatatcca tcctttttcg cacgatatac aggattttgc 15540 

caaagggttc gtgtagactt tccttggtgt atccaacggc gtcagccggg caggataggt 15600 

gaagtaggcc cacccgcgag cgggtgttcc ttcttcactg tcccttattc gcacctggcg 15660 

gtgctcaacg ggaatcctgc tctgcgaggc tggccggcta ccgccggcgt aacagatgag 15720 

ggcaagcgga tggctgatga aaccaagcca accaggaagg gcagcccacc tatcaaggtg 15780 

tactgccttc cagacgaacg aagagcgatt gaggaaaagg cggcggcggc cggcatgagc 15840 
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ctgtcggcct acctgctggc cgtcggccag ggctacaaaa tcacgggcgt cgtggactat 15900 

gagcacgtcc gcgagctggc ccgcatcaat ggcgacctgg gccgcctggg cggcctgctg 15960 

aaactctggc tcaccgacga cccgcgcacg gcgcggttcg gtgatgccac gatcctcgcc 16020 

ctgctggcga agatcgaaga gaagcaggac gagcttggca aggtcatgat gggcgtggtc 16080 

cgcccgaggg cagagccatg acttttttag ccgctaaaac ggccgggggg tgcgcgtgat 16140 

tgccaagcac gtccccatgc gctccatcaa gaagagcgac ttcgcggagc tggtgaagta 16200 

catcaccgac gagcaaggca agaccgagcg cctttgcgac gctca 16245 
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<211> 17877 
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ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 
aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 
aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 
ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 
cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 
caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 
gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 
tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 
ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 
tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 
cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 
tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 
atcggggcag taacgggatrg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 
ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 
ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 
gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 
ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 
acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 
acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 
agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 
ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 
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ctaatgcttg aaacccagga caataacctt 
atgactccaa cttattgata gtgttttatg 
agctccaccg attttgagaa cgacagcgac 
agattcaggt tatgccgctc aattcgctgc 
cccttcaggc gggattcata cagcggccag 
ggtgacagca ggctcataag acgccccagc 
gcaacaaccg tcttccggag actgtcatac 
gccccgacat agccccactg ttcgtccatt 
tgtatgcgcg aggttaccga ctgcggcctg. 
ggccaacgcc cataatgcgg gctgttgccc 
tgattttctg gtgcgtaccg ggttgagaag 
tacggcagtg agagcagaga tagcgctgat 
ccccgtcagt agctgaacag gagggacagc 

aaaaacacca tcatacacta aatcagtaag 
aaatcggctc cgtcgatact atgttatacg 
ttttctggta tttaaggttt tagaatgcaa 
aattagcttc ttggggtatc tttaaatact 
taaaatgaga atatcaccgg aattgaaaaa 
tacggaagga atgtctcctg ctaaggtata 
tttaaaaatg acggacagcc ggtataaagg 
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atagcttgta aattctatca taattgggta 1320 

ttcagataat gcccgatgac tttgtcatgc 1380 

ttccgtccca gccgtgccag gtgctgcctc 1440 

gtatatcgct tgctgattac gtgcagcttt 1500 

ccatccgtca tccatatcac cacgtcaaag 1560 

gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcgtaaaaca gccagcgctg gcgcgattta 1680 

tccgcgcaga cgatgacgtc actgcccggc 1740 

agttttttaa gtgacgtaaa atcgtgttga 1800 

ggcatccaac gccattcatg gccatatcaa 18 60 

cggtgtaagt gaactgcagt tgccatgttt 1920 

gtccggcggt gcttttgccg ttacgcacca 1980 

tgatagacac agaagccact ggagcacctc 2040 

ttggcagcat cacccataat tgtggtttca 2100 

ccaactttga aaacaacttt gaaaaagctg 2160 

ggaacagtga attggagttc gtcttgttat 2220 

gtagaaaaga ggaaggaaat aataaatggc 2280 

actgatcgaa aaataccgct gcgtaaaaga 2340 

taagctggtg ggagaaaatg aaaacctata 2400 

gaccacctat gatgtggaac gggaaaagga 24 60 
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catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 
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tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 
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tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg" tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 
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tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 
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tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7 680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 
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cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt . 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag- cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 
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cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt ttttcgagtt 10800 

tttttttttt ttctttgtga aggatttatt gttattggta tccatttttt attggaagac 10860 

aagataagtt aatattgatt ttgcttaaag attaaaagga aatcagaaaa cgacaataaa 10920 

aaatgtaacg gacaaactat ggtgtcgatt ataagtctaa atccttaaaa aatgacaacg 10980 
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agttgctttc ctctgaaaac aattcttttg tctttgcaag aaaggtttct tttttgtttg 11040 

cttgcattac ttaaacatca aatcaaatga aaggaataaa gcagatttga gggcgaataa 11100 

ggattttctg gtcaacaaga tgtgagtgac acctaaggaa ctaaatgcca ttcatttgtt 11160 

ttaaaacgac atcaaagatt gatgatcaac aggattgaga gagagaaaaa gaactcgtgt 11220 

catttatttc tgttgactga aattttatat ttagaaaaaa tgtcaaatct atagctttag 11280 

ctatattaca taacatttga aataataata ataaaaaaag acacattaga gacacttttc 11340 

aaactctaaa taactgtcta taaacacaaa gaaaacaaag acctctataa caacttatta 11400 

gatttttctc gtacttttgt ctaaagatga tgtattcttg ttatcccaca cttctttcat 11460 

ttgttcttga tgctactaaa tatacaaaat ttcttttttg caagagatat tattccaaaa 11520 

attttcaaaa agaaattttt ttcacaatag cagttgatcg tgtaacccaa agaggttctt 11580 

tgttattttg cacttccgct ttgcggtgat gcatattcaa agtaatatat ggaataaaca 11640 

acgtgtttaa gcatgaaaga aaggaaacaa aggccgcttt gaacaaatgc ataatatttc 11700 

agacaaaaat gatctaaagc aagcagtaaa tcaaacaaga aacattgctg attcgcgtta 11760 

gaaaacgata aaagtctaat aagccactaa gtatacttca atgaactttt tgtatgctta 11820 

tggtccaatc agaccaataa tttgtgacca ttcctgaggt ggctttggtg atgcggaaac 11880 

agaaaaaaat tttctcacca atcgatttaa aaaacaattt ctgctttgaa ccaaaacttt 11940 

ttttttctct ttaatcatta actttatcaa gtatgtacct accctcaaag tcctcactca 12000 

agcacaatta tgctaacatt gttccacctt ctctttagaa atgctgtcga agctgcagtc 12060 

aatcagcgtc aaggcccgcc gcgttgaact agcccgcgac atcacgcggc ccaaagtctg 12120 

cctgcatgct cagcggtgct cgttagttcg gctgcgagtg gcagcaccac agacagagga 12180 

ggcgctggga accgtgcagg ctgccggcgc gggcgatgag cacagcgccg atgtagcact 12240 
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ccagcagctt gaccgggcta tcgcagagcg tcgtgcccgg cgcaaacggg agcagctgtc 12300 

ataccaggct gccgccattg cagcatcaat tggcgtgtca ggcattgcca tcttcgccac 12360 

ctacctgaga tttgccatgc acatgaccgt gggcggcgca gtgccatggg gtgaagtggc 12420 

tggcactctc ctcttggtgg ttggtggcgc gctcggcatg gagatgtatg cccgctatgc 12480 

acacaaagcc atctggcatg agtcgcctct gggctggctg ctgcacaaga gccaccacac 12540 

acctcgcact ggaccctttg aagccaacga cttgtttgca atcatcaatg gactgcccgc 12600 

catgctcctg tgtacctttg gcttctggct gcccaacgtc ctgggggcgg cctgctttgg 12660 

agcggggctg ggcatcacgc tatacggcat ggcatatatg tttgtacacg atggcctggt 12720 

gcacaggcgc tttcccaccg ggcccatcgc tggcctgccc tacatgaagc gcctgacagt 12780 

ggcccaccag ctacaccaca gcggcaagta cggtggcgcg ccctggggta tgttcttggg 12840 

tccacaggag ctgcagcaca ttccaggtgc ggcggaggag gtggagcgac tggtcctgga 12900 

actggactgg tccaagcggt agaagcttgg cgtaatcatg gtcatagctg tttcctgtgt 12960 

gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag 13020 

cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt 13080 

tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 13140 

gcggtttgcg tattgggcca aagacaaaag ggcgacattc aaccgattga gggagggaag 13200 

gtaaatattg acggaaatta ttcattaaag gtgaattatc accgtcaccg acttgagcca 13260 

tttgggaatt agagccagca aaatcaccag tagcaccatt accattagca aggccggaaa 13320 

cgtcaccaat gaaaccatcg atagcagcac cgtaatcagt agcgacagaa tcaagtttgc 13380 

ctttagcgtc agactgtagc gcgttttcat cggcattttc ggtcatagcc cccttattag 13440 
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cgtttgccat cttttcataa tcaaaatcac cggaaccaga gccaccaccg gaaccgcctc 13500 

cctcagagcc gccaccctca gaaccgccac cctcagagcc accaccctca gagccgccac 13560 

cagaaccacc accagagccg ccgccagcat tgacaggagg cccgatctag taacatagat 13620 

gacaccgcgc gcgataattt atcctagttt gcgcgctata ttttgttttc tatcgcgtat 13680 

taaatgtata attgcgggac tctaatcata aaaacccatc tcataaataa cgtcatgcat 13740 

tacatgttaa ttattacatg cttaacgtaa ttcaacagaa attatatgat aatcatcgca 13800 

agaccggcaa caggattcaa tcttaagaaa ctttattgcc aaatgtttga acgatcgggg 13860 

atcatccggg tctgtggcgg gaactccacg aaaatatccg aacgcagcaa gatatcgcgg 13920 

tgcatctcgg tcttgcctgg gcagtcgccg ccgacgccgt tgatgtggac gccgggcccg 13980 

atcatattgt cgctcaggat cgtggcgttg tgcttgtcgg ccgttgctgt cgtaatgata 14040 

tcggcacctt cgaccgcctg ttccgcagag atcccgtggg cgaagaactc cagcatgaga 14100 

tccccgcgct ggaggatcat ccagccggcg tcccggaaaa cgattccgaa gcccaacctt 14160 

tcatagaagg cggcggtgga atcgaaatct cgtgatggca ggttgggcgt cgcttggtcg 14220 

gtcatttcga accccagagt cccgctcaga agaactcgtc aagaaggcga tagaaggcga 14280 

tgcgctgcga atcgggagcg gcgataccgt aaagcacgag gaagcggtca gcccattcgc 14340 

cgccaagctc ttcagcaata tcacgggtag ccaacgctat gtcctgatag cggtccgcca 14400 

cacccagccg gccacagtcg atgaatccag aaaagcggcc attttccacc atgatattcg 14460 

gcaagcaggc atcgccatgg gtcacgacga gatcatcgcc gtcgggcatg cgcgccttga 14520 

gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat 14580 

cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt 14640 
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cgaatgggca ggtagccgga tcaagcgtat gcagccgccg cattgcatca gccatgatgg 

atactttctc ggcaggagca aggtgagatg acaggagatc ctgccccggc acttcgccca 

atagcagcca gtcccttccc gcttcagtga caacgtcgag cacagctgcg caaggaacgc 

ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg cagttcattc agggcaccgg 

acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc tgacagccgg aacacggcgg 

catcagagca gccgattgtc tgttgtgccc agtcatagcc gaatagcctc tccacccaag 

cggccggaga acctgcgtgc aatccatctt gttcaatcat gcgaaacgat ccagatccgg 

tgcagattat ttggattgag agtgaatatg agactctaat tggataccga ggggaattta 

tggaacgtca gtggagcatt tttgacaaga aatatttgct agctgatagt gaccttaggc 

gacttttgaa cgcgcaataa tggtttctga cgtatgtgct tagctcatta aactccagaa 

acccgcggct gagtggctcc ttcaacgttg cggttctgtc agttccaaac gtaaaacggc 

ttgtcccgcg tcatcggcgg gggtcataac gtgactccct taattctccg ctcatgatca 

gattgtcgtt tcccgcctt-c agtttaaact atcagtgttt gacaggatat attggcgggt 

aaacctaaga gaaaagagcg tttattagaa taatcggata tttaaaaggg cgtgaaaagg 

tttatccgtt cgtccatttg tatgtgcatg ccaaccacag ggttccccag atctggcgcc 

ggccagcgag acgagcaaga ttggccgccg cccgaaacga tccgacagcg cgcccagcac 

aggtgcgcag gcaaattgca ccaacgcata cagcgccagc agaatgccat agtgggcggt 

gacgtcgttc gagtgaacca gatcgcgcag gaggcccggc agcaccggca taatcaggcc 

gatgccgaca gcgtcgagcg cgacagtgct cagaattacg atcaggggta tgttgggttt 

cacgtctggc ctccggacca gcctccgctg gtccgattga acgcgcggat tctttatcac 

tgataagttg gtggacatat tatgtttatc agtgataaag tgtcaagcat gacaaagttg 
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cagccgaata cagtgatccg tgccgccctg gacctgttga acgaggtcgg cgtagacggt 15960 

ctgacgacac gcaaactggc ggaacggttg ggggttcagc agccggcgct ttactggcac 16020 

ttcaggaaca agcgggcgct gctcgacgca ctggccgaag ccatgctggc ggagaatcat 16080 

acgcattcgg tgccgagagc cgacgacgac tggcgctcat ttctgatcgg gaatgcccgc 16140 

agcttcaggc aggcgctgct cgcctaccgc gatggcgcgc gcatccatgc cggcacgcga 16200 

ccgggcgcac cgcagatgga aacggccgac gcgcagcttc gcttcctctg cgaggcgggt 16260 

ttttcggccg gggacgccgt caatgcgctg atgacaatca gctacttcac tgttggggcc 16320 

gtgcttgagg agcaggccgg cgacagcgat gccggcgagc gcggcggcac cgttgaacag 16380 

gctccgctct cgccgctgtt gcgggccgcg atagacgcct tcgacgaagc cggtccggac 16440 

gcagcgttcg agcagggact cgcggtgatt gtcgatggat tggcgaaaag gaggctcgtt 16500 

gtcaggaacg ttgaaggacc gagaaagggt gacgattgat caggaccgct gccggagcgc 16560 

aacccactca ctacagcaga gccatgtaga caacatcccc tccccctttc caccgcgtca 16620 

gacgcccgta gcagcccgct acgggctttt tcatgccctg ccctagcgtc caagcctcac 16680 

ggccgcgctc ggcctctctg gcggccttct ggcgctcttc cgcttcctcg ctcactgact 16740 

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac 16800 

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa 16860 

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg 16920 

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa 16980 

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc 17040 

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgcttttc cgctgcataa 17100 
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ccctgcttcg gggtcattat agcgattttt tcggtatatc catccttttt cgcacgatat 17160 

acaggatttt gccaaagggt tcgtgtagac tttccttggt gtatccaacg gcgtcagccg 17220 

ggcaggatag gtgaagtagg cccacccgcg agcgggtgtt ccttcttcac tgtcccttat 17280 

tcgcacctgg cggtgctcaa cgggaatcct gctctgcgag gctggccggc taccgccggc 17340 

gtaacagatg agggcaagcg gatggctgat gaaaccaagc caaccaggaa gggcagccca 17400 

cctatcaagg tgtactgcct tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg 174 60 

gccggcatga gcctgtcggc ctacctgctg gccgtcggcc agggctacaa aatcacgggc 17520 

gtcgtggact atgagcacgt ccgcgagctg gcccgcatca atggcgacct gggccgcctg 17580 

ggcggcctgc tgaaactctg gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc 17640 

acgatcctcg ccctgctggc gaagatcgaa gagaagcagg acgagcttgg caaggtcatg 17700 

atgggcgtgg tccgcccgag ggcagagcca tgactttttt agccgctaaa acggccgggg 17760 

ggtgcgcgtg attgccaagc acgtccccat gcgctccatc aagaagagcg acttcgcgga 17820 

gctggtgaag tacatcaccg acgagcaagg caagaccgag cgcctttgcg acgctca 17877 



<210> 38 

<211> 17238 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 38 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 
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gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 
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aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 



BASF AG 1 16/365 January 08, 2004 

BASF NAE 877/03 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 34 80 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacga'g ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 
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ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 
ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg • 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 
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tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 
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gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 
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cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 
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ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaart agctccatgt. caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 
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atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt ctaccgcttg 10800 

gaccagtcca gttccaggac cagtcgctcc acctcctccg ccgcacctgg aatgtgctgc 10860 

agctcctgtg gacccaagaa cataccccag ggcgcgccac cgtacttgcc gctgtggtgt 10920 

agctggtggg ccactgtcag gcgcttcatg tagggcaggc cagcgatggg cccggtggga 10980 

aagcgcctgt gcaccaggcc atcgtgtaca aacatatatg ccatgccgta tagcgtgatg 11040 

cccagccccg ctccaaagca ggccgccccc aggacgttgg gcagccagaa gccaaaggta 11100 

cacaggagca tggcgggcag tccattgatg attgcaaaca agtcgttggc ttcaaagggt 11160 

ccagtgcgag gtgtgtggtg gctcttgtgc agcagccagc ccagaggcga ctcatgccag 11220 

atggctttgt gtgcatagcg ggcatacatc tccatgccga gcgcgccacc aaccaccaag 11280 

aggagagtgc cagccacttc accccatggc actgcgccgc ccacggtcat gtgcatggca 11340 

aatctcaggt aggtggcgaa gatggcaatg cctgacacgc caattgatgc tgcaatggcg 11400 

gcagcctggt atgacagctg ctcccgtttg cgccgggcac gacgctctgc gatagcccgg 11460 

tcaagctgct ggagtgctac atcggcgctg tgctcatcgc ccgcgccggc agcctgcacg 11520 

gttcccagcg cctcctctgt ctgtggtgct gccactcgca gccgaactaa cgagcaccgc 11580 

tgagcatgca ggcagacttt gggccgcgtg atgtcgcggg ctagttcaac gcggcgggcc 11640 

ttgacgctga ttgactgcag cttcgacagc atagagataa aataaaaaga gaagaaaaga 11700 

aagtttgtac aatttctttt tgtttatata acatacacgc tatgtcaaca tttagaataa 11760 

gggggaaaaa atcttccatc atattcgaat gcacaagatt atttctttgt tcgctctttt 11820 

tggtcgggtc atcgagattt agagtgtaat caaagatact gtcatctcga gagcgttgca 11880 
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caggctgctg tttgccaaat tggatgtttg ccgaattagt aaaatacgca agcatttctt 11940 

acctttccgc tcccttttcc taattctccc aaagactaaa tgaggaaaga taaaggacaa 12000 

agaaaatgta aagacaaaga aattgaaaac gatataaact tgcagcacgt aagaccaaag 12060 

caaattggta actattcttg tgtacaaaca tgtataaaaa aaaacttttt tttgctcctg 12120 

gaggacaaaa tttcaaactc cttgaagaag attgcttgta tatctatcat atgcatatat 12180 

catatcgatg gaaaaagaaa gtcaggcatg tatttataaa aagaagaatg tgccatgctt 12240 

ccgaatttct tttcactttc ttttccttat ctattttaat ctcaagcttg gcgtaatcat 12300 

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 12360 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 12420 

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 12480 

tcggccaacg cgcggggaga ggcggtttgc gtattgggcc aaagacaaaa gggcgacatt 12540 

caaccgattg agggagggaa ggtaaatatt gacggaaatt attcattaaa ggtgaattat 12600 

caccgtcacc gacttgagcc atttgggaat tagagccagc aaaatcacca gtagcaccat 12660 

taccattagc aaggccggaa acgtcaccaa tgaaaccatc gatagcagca ccgtaatcag 12720 

tagcgacaga atcaagtttg cctttagcgt cagactgtag cgcgttttca tcggcatttt 12780 

cggtcatagc ccccttatta gcgtttgcca tcttttcata atcaaaatca ccggaaccag 12840 

agccaccacc ggaaccgcct ccctcagagc cgccaccctc agaaccgcca ccctcagagc 12900 

caccaccctc agagccgcca ccagaaccac caccagagcc gccgccagca ttgacaggag 12960 

gcccgatcta gtaacataga tgacaccgcg cgcgataatt tatcctagtt tgcgcgctat 13020 

attttgtttt ctatcgcgta ttaaatgtat aattgcggga ctctaatcat aaaaacccat 13080 

ctcataaata acgtcatgca ttacatgtta attattacat gcttaacgta attcaacaga 13140 
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aattatatga taatcatcgc aagaccggca acaggattca atcttaagaa actttattgc 13200 

caaatgtttg aacgatcggg gatcatccgg gtctgtggcg ggaactccac gaaaatatcc 13260 

gaacgcagca agatatcgcg gtgcatctcg gtcttgcctg ggcagtcgcc gccgacgccg 13320 

ttgatgtgga cgccgggccc gatcatattg tcgctcagga tcgtggcgtt gtgcttgtcg 13380 

gccgttgctg tcgtaatgat atcggcacct tcgaccgcct gttccgcaga gatcccgtgg 13440 

gcgaagaact ccagcatgag atccccgcgc tggaggatca tccagccggc gtcccggaaa 13500 

acgattccga agcccaacct ttcatagaag gcggcggtgg aatcgaaatc tcgtgatggc 13560 

aggttgggcg tcgcttggtc ggtcatttcg aaccccagag tcccgctcag aagaactcgt 13620 

caagaaggcg atagaaggcg atgcgctgcg aatcgggagc ggcgataccg taaagcacga 13680 

ggaagcggtc agcccattcg ccgccaagct cttcagcaat atcacgggta gccaacgcta 13740 

tgtcctgata gcggtccgcc acacccagcc ggccacagtc gatgaatcca gaaaagcggc 13800 

cattttccac catgatattc ggcaagcagg catcgccatg ggtcacgacg agatcatcgc 13860 

cgtcgggcat gcgcgccttg agcctggcga acagttcggc tggcgcgagc ccctgatgct 13920 

cttcgtccag atcatcctga tcgacaagac cggcttccat ccgagtacgt gctcgctcga 13980 

tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc 14040 

gcattgcatc agccatgatg gatactttct cggcaggagc aaggtgagat gacaggagat 14100 

cctgccccgg cacttcgccc aatagcagcc agtcccttcc cgcttcagtg acaacgtcga 14160 

gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga tagccgcgct gcctcgtcct 14220 

gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg 14280 

ctgacagccg gaacacggcg gcatcagagc agccgattgt ctgttgtgcc cagtcatagc 14340 
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cgaatagcct ctccacccaa gcggccggag aacctgcgtg caatccatct tgttcaatca 14400 

tgcgaaacga tccagatccg gtgcagatta tttggattga gagtgaatat gagactctaa 144 60 

ttggataccg aggggaattt atggaacgtc agtggagcat ttttgacaag aaatatttgc 14520 

tagctgatag tgaccttagg cgacttttga acgcgcaata atggtttctg acgtatgtgc 14580 

ttagctcatt aaactccaga aacccgcggc tgagtggctc cttcaacgtt gcggttctgt 14640 

cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa cgtgactccc 14700 

ttaattctcc gctcatgatc . agattgtcgt ttcccgcctt cagtttaaac tatcagtgtt 14760 

tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga ataatcggat 14820 

atttaaaagg gcgtgaaaag gtttatccgt tcgtccattt gtatgtgcat gccaaccaca 14880 

gggttcccca gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg 14 940 

atccgacagc gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag 15000 

cagaatgcca tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg 15060 

cagcaccggc ataatcagg*c cgatgccgac agcgtcgagc gcgacagtgc tcagaattac 15120 

gatcaggggt atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg 15180 

aacgcgcgga ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa 15240 

gtgtcaagca tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg 15300 

aacgaggtcg gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag 15360 

cagccggcgc tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa 15420 

gccatgctgg cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca 15480 

tttctgatcg ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg 15540 

cgcatccatg ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt 15600 
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cgcttcctct gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc 15660 

agctacttca ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag 15720 

cgcggcggca ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc 15780 

ttcgacgaag ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga 15840 

ttggcgaaaa ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga 15900 

tcaggaccgc tgccggagcg caacccactc actacagcag agccatgtag acaacatccc 15960 

ctcccccttt ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct 16020 

gccctagcgt ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt 16080 

ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 16140 

ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 16200 

tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 16260 

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 16320 

gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 16380 

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 16440 

tggcgctttt ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat 16500 

ccatcctttt tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg 16560 

tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt 16620 

tccttcttca ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga 16680 

ggctggccgg ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag 16740 

ccaaccagga agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg 16800 
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attgaggaaa aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc 16860 

cagggctaca aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc 16920 

aatggcgacc tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc 16980 

acggcgcggt tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag 17040 

gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt 17100 

tagccgctaa aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat 17160 

caagaagagc gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga 17220 

gcgcctttgc gacgctca 17238 



<210> 39 

<211> 17238 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 
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<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 
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<400> 39 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga . tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 
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ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 
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catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 
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tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcgga-t tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 
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tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 
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tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtea caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 
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tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 774 0 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cqccqcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 
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cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 
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ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt agagataaaa 10800 

taaaaagaga agaaaagaaa gtttgtacaa tttctttttg tttatataac atacacgcta 10860 

tgtcaacatt tagaataagg gggaaaaaat cttccatcat attcgaatgc acaagattat 10920 

ttctttgttc gctctttttg gtcgggtcat cgagatttag agtgtaatca aagatactgt 10980 

catctcgaga gcgttgcaca ggctgctgtt tgccaaattg gatgtttgcc gaattagtaa 11040 
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aatacgcaag catttcttac ctttccgctc ccttttccta attctcccaa agactaaatg 11100 

aggaaagata aaggacaaag aaaatgtaaa gacaaagaaa ttgaaaacga tataaacttg 11160 

cagcacgtaa gaccaaagca aattggtaac tattcttgtg tacaaacatg tataaaaaaa 11220 

aacttttttt tgctcctgga ggacaaaatt tcaaactcct tgaagaagat tgcttgtata 11280 

tctatcatat gcatatatca tatcgatgga aaaagaaagt caggcatgta tttataaaaa 11340 

gaagaatgtg ccatgcttcc gaatttcttt tcactttctt ttccttatct attttaatct 11400 

catgctgtcg aagctgcagt caatcagcgt caaggcccgc cgcgttgaac tagcccgcga 11460 

catcacgcgg cccaaagtct gcctgcatgc tcagcggtgc tcgttagttc ggctgcgagt 11520 

ggcagcacca cagacagagg aggcgctggg aaccgtgcag gctgccggcg cgggcgatga 11580 

gcacagcgcc gatgtagcac tccagcagct tgaccgggct atcgcagagc gtcgtgcccg 11640 

gcgcaaacgg gagcagctgt cataccaggc tgccgccatt gcagcatcaa ttggcgtgtc 11700 

aggcattgcc atcttcgcca cctacctgag atttgccatg cacatgaccg tgggcggcgc 11760 

agtgccatgg ggtgaagtgg ctggcactct cctcttggtg gttggtggcg cgctcggcat 11820 

ggagatgtat gcccgctatg cacacaaagc catctggcat gagtcgcctc tgggctggct 11880 

gctgcacaag agccaccaca cacctcgcac tggacccttt gaagccaacg acttgtttgc 11940 

aatcatcaat ggactgcccg ccatgctcct gtgtaccttt ggcttctggc tgcccaacgt 12000 

cctgggggcg gcctgctttg gagcggggct gggcatcacg ctatacggca tggcatatat 12060 

gtttgtacac gatggcctgg tgcacaggcg ctttcccacc gggcccatcg ctggcctgcc 12120 

ctacatgaag cgcctgacag tggcccacca gctacaccac agcggcaagt acggtggcgc 12180 

gccctggggt atgttcttgg gtccacagga gctgcagcac attccaggtg cggcggagga 12240 

ggtggagcga ctggtcctgg aactggactg gtccaagcgg tagaagcttg gcgtaatcat 12300 
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ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 12360 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 12420 

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 12480 

tcggccaacg cgcggggaga ggcggtttgc gtattgggcc aaagacaaaa gggcgacatt 12540 

caaccgattg agggagggaa ggtaaatatt gacggaaatt attcattaaa ggtgaattat 12600 

caccgtcacc gacttgagcc atttgggaat tagagccagc aaaatcacca gtagcaccat 12660 

taccattagc aaggccggaa acgtcaccaa tgaaaccatc gatagcagca ccgtaatcag 12720 

tagcgacaga atcaagtttg cctttagcgt cagactgtag cgcgttttca tcggcatttt 12780 

cggtcatagc ccccttatta gcgtttgcca tcttttcata atcaaaatca ccggaaccag 12840 

agccaccacc ggaaccgcct ccctcagagc cgccaccctc agaaccgcca ccctcagagc 12900 

caccaccctc agagccgcca ccagaaccac caccagagcc gccgccagca ttgacaggag 12960 

gcccgatcta gtaacataga tgacaccgcg cgcgataatt tatcctagtt tgcgcgctat 13020 

attttgtttt ctatcgcgta ttaaatgtat aattgcggga ctctaatcat aaaaacccat 13080 

ctcataaata acgtcatgca ttacatgtta attattacat gcttaacgta attcaacaga 13140 

aattatatga taatcatcgc aagaccggca acaggattca atcttaagaa actttattgc 13200 

caaatgtttg aacgatcggg gatcatccgg gtctgtggcg ggaactccac gaaaatatcc 13260 

gaacgcagca agatatcgcg gtgcatctcg gtcttgcctg ggcagtcgcc gccgacgccg 13320 

ttgatgtgga cgccgggccc gatcatattg tcgctcagga tcgtggcgtt gtgcttgtcg 13380 

gccgttgctg tcgtaatgat atcggcacct tcgaccgcct gttccgcaga gatcccgtgg 13440 

gcgaagaact ccagcatgag atccccgcgc tggaggatca tccagccggc gtcccggaaa 13500 
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acgattccga agcccaacct ttcatagaag gcggcggtgg aatcgaaatc tcgtgatggc 13560 

aggttgggcg tcgcttggtc ggtcatttcg aaccccagag tcccgctcag aagaactcgt 13620 

caagaaggcg atagaaggcg atgcgctgcg aatcgggagc ggcgataccg taaagcacga 13680 

ggaagcggtc agcccattcg ccgccaagct cttcagcaat atcacgggta gccaacgcta 13740 

tgtcctgata gcggtccgcc acacccagcc ggccacagtc gatgaatcca gaaaagcggc 13800 

cattttccac catgatattc ggcaagcagg catcgccatg ggtcacgacg agatcatcgc 13860 

cgtcgggcat gcgcgccttg agcctggcga acagttcggc tggcgcgagc ccctgatgct 13920 

cttcgtccag atcatcctga tcgacaagac cggcttccat ccgagtacgt gctcgctcga 13980 

tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc 14040 

gcattgcatc agccatgatg gatactttct cggcaggagc aaggtgagat gacaggagat 14100 

cctgccccgg cacttcgccc aatagcagcc agtcccttcc cgcttcagtg acaacgtcga 14160 

gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga tagccgcgct gcctcgtcct 14220 

gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg 14280 

ctgacagccg gaacacggcg gcatcagagc agccgattgt ctgttgtgcc cagtcatagc 14340 

cgaatagcct ctccacccaa gcggccggag aacctgcgtg caatccatct tgttcaatca 14400 

tgcgaaacga tccagatccg gtgcagatta tttggattga gagtgaatat gagactctaa 144 60 

ttggataccg aggggaattt atggaacgtc agtggagcat ttttgacaag aaatatttgc 14520 

tagctgatag tgaccttagg cgacttttga acgcgcaata atggtttctg acgtatgtgc 14580 

ttagctcatt aaactccaga aacccgcggc tgagtggctc cttcaacgtt gcggttctgt 14640 

cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa cgtgactccc 14700 

ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac tatcagtgtt 14760 
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tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga ataatcggat 14820 

atttaaaagg gcgtgaaaag gtttatccgt tcgtccattt gtatgtgcat. gccaaccaca 14880 

gggttcccca gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg 14940 

atccgacagc gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag 15000 

cagaatgcca tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg 15060 

cagcaccggc ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac 15120 

gatcaggggt atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg 15180 

aacgcgcgga ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa 15240 

gtgtcaagca tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg 15300 

aacgaggtcg gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag 15360 

cagccggcgc tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa 15420 

gccatgctgg cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca 15480 

tttctgatcg ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg 15540 

cgcatccatg ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt 15600 

cgcttcctct gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc 15660 

agctacttca ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag 15720 

cgcggcggca ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc 15780 

ttcgacgaag ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga 15840 

ttggcgaaaa ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga 15900 

tcaggaccgc tgccggagcg caacccactc actacagcag agccatgtag acaacatccc 15960 
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ctcccccttt ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct 16020 

gccctagcgt ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt 16080 

ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 16140 

ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 16200 

tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 16260 

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 16320 

gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 16380 

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 16440 

tggcgctttt ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat 16500 

ccatcctttt tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg 16560 

tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt 16620 

tccttcttca ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga 16680 

ggctggccgg ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag 16740 

ccaaccagga agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg 16800 

attgaggaaa aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc 16860 

cagggctaca. aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc 16920 

aatggcgacc tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc 16980 

acggcgcggt tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag 17040 

gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt 17100 

tagccgctaa aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat 17160 

caagaagagc gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga 17220 
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misc_f eature 
(3471) . . (3471) 
n is a, c, g, or t 

misc_f eature 
(3679) . . (3679) 
n is a, c, g, or t 

misc__f eature - 
(3770) . . (3770) 
n is a, c, g, or t 



gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 



60 



cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 



120 



cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 



180 



cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 



240 



tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 



300 



gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 



360 
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gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 
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ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 2460 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 
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ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caaagcttga gattaaaata gataaggaaa 4020 

agaaagtgaa aagaaattcg gaagcatggc acattcttct ttttataaat acatgcctga 4080 
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ctttcttttt ccatcgatat gatatatgca tatgatagat atacaagcaa tcttcttcaa 4140 

ggagtttgaa attttgtcct ccaggagcaa aaaaaagttt ttttttatac atgtttgtac 4200 

acaagaatag ttaccaattt gctttggtct tacgtgctgc aagtttatat cgttttcaat 4260 

ttctttgtct ttacattttc tttgtccttt atctttcctc atttagtctt tgggagaatt 4320 

aggaaaaggg agcggaaagg taagaaatgc ttgcgtattt tactaattcg gcaaacatcc 4380 

aatttggcaa acagcagcct gtgcaacgct ctcgagatga cagtatcttt gattacactc 4 440 

taaatctcga tgacccgacc aaaaagagcg aacaaagaaa taatcttgtg cattcgaata 4 500 

tgatggaaga ttttttcccc cttattctaa atgttgacat agcgtgtatg ttatataaac 4560 

aaaaagaaat tgtacaaact ttcttttctt ctctttttat tttatctcta tgctgtcgaa 4620 

gctgcagtca atcagcgtca aggcccgccg cgttgaacta gcccgcgaca tcacgcggcc 4 680 

caaagtctgc ctgcatgctc agcggtgctc gttagttcgg ctgcgagtgg cagcaccaca 4740 

gacagaggag gcgctgggaa ccgtgcaggc tgccggcgcg ggcgatgagc acagcgccga 4 800 

tgtagcactc cagcagcttg accgggctat cgcagagcgt cgtgcccggc gcaaacggga 4 860 

gcagctgtca taccaggctg ccgccattgc agcatcaatt ggcgtgtcag gcattgccat 4 920 

cttcgccacc tacctgagat ttgccatgca catgaccgtg ggcggcgcag tgccatgggg 4 980 

tgaagtggct ggcactctcc tcttggtggt tggtggcgcg ctcggcatgg agatgtatgc 5040 

ccgctatgca cacaaagcca tctggcatga gtcgcctctg ggctggctgc tgcacaagag 5100 

ccaccacaca cctcgcactg gaccctttga agccaacgac ttgtttgcaa tcatcaatgg 5160 

actgcccgcc atgctcctgt gtacctttgg cttctggctg cccaacgtcc tgggggcggc 5220 

ctgctttgga gcggggctgg gcatcacgct atacggcatg gcatatatgt ttgtacacga 5280 
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tggcctggtg cacaggcgct ttcccaccgg gcccatcgct ggcctgccct acatgaagcg 5340 

cctgacagtg gcccaccagc tacaccacag cggcaagtac ggtggcgcgc cctggggtat 5400 

gttcttgggt ccacaggagc tgcagcacat tccaggtgcg gcggaggagg tggagcgact 54 60 

ggtcctggaa ctggactggt ccaagcggta gattgtgact gatagcgaga ctctgggtcg 5520 

atgttatctg cctcaacaat ggcttagaaa agaagaaaca gaacaaatac agcaaggcaa 5580 

cgcccgtagc ctaggtgatc aaagactgtt gggcttgtct ctgaagcttg taggaaaggc 5640 

agacgctatc atggtgagag ctaagaaggg cattgacaag ttgccggcaa actgtcaagg 5700 

cggtgtacga gctgcttgcc aagtatatgc tgcaattgga tctgtactca agcagcagaa 5760 

gacaacatat cctacaagag ctcatctaaa aggaagcgaa cgtgccaaga ttgctctgtt 5820 

gagtgtatac aacctctatc aatctgaaga caagcctgtg gctctccgtc aagctagaaa 5880 

gattaagagt ttttttgttg attagtgaat ttttgtttta tttatgtctg atagttcaat 5940 

aaagagacaa cacatacaat ataaaatcat tgtctttaaa tgttaattta gtagagtgta 6000 

aagcctgcat tttttttgta cgcataaaca atgaattcac cccgcttctg gtttttaaat 6060 

aattatgtca aactagggaa aattcttttt tttctcttcg ttcttttttt ggcttgttgt 6120 

ggagtcacag gcttgtcttc agattgatag aggttgtata cactcaacag agcaatcttg 6180 

gcacgttcgc ttccttttag atgagctctt gtaggatatg ttgtcttctg ctgcttgagt 6240 

acagatccaa ttgcagcata tacttggcaa gcagctcgta caccgccttg acagtttgcc 6300 

ggcaacttgt caatgccctt cttagctctc accatgatag cgtctgcctt tcctacaagc 6360 

ttcagagaca agcccaacag tctttgatca cctaggctac gggcgttgcc ttgctgtatt 6420 

tgttctgttt cttcttttct aagccattgt tgaggcagat aacatcgacc caacatcctc 6480 

gagccatact acagcataaa aggatacgtt ttctttaaca gaaatttacc cttttgttat 6540 
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cagcacatac aaaaaaaaag aaatttaaga 
tattcaatcc ataaatgaat tatttttgga 
tatttttttt ttttttacaa ctccaccaat 
tcatagctgt ttcctgtgtg aaattgttat 
ggaagcataa agtgtaaagc ctggggtgcc 
ttgcgctcac tgcccgcttt ccagtcggga 
ggccaacgcg cggggagagg cggtttgcgt 
accgattgag ggagggaagg taaatattga 
ccgtcaccga cttgagccat ttgggaatta 
ccattagcaa ggccggaaac gtcaccaatg 
gcgacagaat caagtttgcc tttagcgtca 
gtcatagccc ccttattagc gtttgccatc 
ccaccaccgg aaccgcctcc ctcagagccg 
ccaccctcag agccgccacc agaaccacca 
ccgatctagt aacatagatg acaccgcgcg 
tttgttttct atcgcgtatt aaatgtataa 
cataaataac gtcatgcatt acatgttaat 
ttatatgata atcatcgcaa gaccggcaac 
aatgtttgaa cgatcgggga tcatccgggt 
acgcagcaag atatcgcggt gcatctcggt 
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tgagtaggac ttccattctc tcaaaaattt 6600 

caaaaaagaa agattatgcc tgattttctc 6660 

actttctagc ccagcttggc gtaatcatgg 6720 

ccgctcacaa ttccacacaa catacgagcc 6780 

taatgagtga gctaactcac attaattgcg 6840 

aacctgtcgt gccagctgca ttaatgaatc 6900 

attgggccaa agacaaaagg gcgacattca 6960 

cggaaattat tcattaaagg tgaattatca 7020 

gagccagcaa aatcaccagt agcaccatta 7080 

aaaccatcga tagcagcacc gtaatcagta 7140 

gactgtagcg cgttttcatc ggcattttcg 7200 

ttttcataat caaaatcacc ggaaccagag 7260 

ccaccctcag aaccgccacc ctcagagcca 7320 

ccagagccgc cgccagcatt gacaggaggc 7380 

cgataattta tcctagtttg cgcgctatat 7440 

ttgcgggact ctaatcataa aaacccatct 7500 

tattacatgc ttaacgtaat tcaacagaaa 7560 

aggattcaat cttaagaaac tttattgcca 7620 

ctgtggcggg aactccacga aaatatccga 7680 

cttgcctggg cagtcgccgc cgacgccgtt 7740 
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gatgtggacg ccgggcccga tcatattgtc 

cgttgctgtc gtaatgatat cggcaccttc 

gaagaactcc agcatgagat ccccgcgctg 

gattccgaag cccaaccttt catagaaggc 

gttgggcgtc gcttggtcgg tcatttcgaa 

agaaggcgat agaaggcgat gcgctgcgaa 

aagcggtcag cccattcgcc gccaagctct 

tcctgatagc ggtccgccac acccagccgg 

ttttccacca tgatattcgg caagcaggca 

tcgggcatgc gcgccttgag cctggcgaac 

tcgtccagat catcctgatc gacaagaccg 

cgatgtttcg cttggtggtc gaatgggcag 

attgcatcag ccatgatgga tactttctcg 

tgccccggca cttcgcccaa tagcagccag 

acagctgcgc aaggaacgcc cgtcgtggcc 

agttcattca gggcaccgga caggtcggtc 

gacagccgga acacggcggc atcagagcag 

aatagcctct ccacccaagc ggccggagaa 

cgaaacgatc cagatccggt gcagattatt 

ggataccgag gggaatttat ggaacgtcag 

gctgatagtg accttaggcg acttttgaac 
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gctcaggatc gtggcgttgt gcttgtcggc 7800 

gaccgcctgt tccgcagaga tcccgtgggc 7860 

gaggatcatc cagccggcgt cccggaaaac 7 920 

ggcggtggaa tcgaaatctc gtgatggcag 7980 

ccccagagtc ccgctcagaa gaactcgtca 8040 

tcgggagcgg cgataccgta aagcacgagg 8100 

tcagcaatat cacgggtagc caacgctatg 8160 

ccacagtcga tgaatccaga aaagcggcca 8220 

tcgccatggg tcacgacgag atcatcgccg 8280 

agttcggctg gcgcgagccc ctgatgctct 8340 

gcttccatcc gagtacgtgc tcgctcgatg 8400 

gtagccggat caagcgtatg cagccgccgc 84 60 

gcaggagcaa ggtgagatga caggagatcc 8520 

tcccttcccg cttcagtgac aacgtcgagc 8580 

agccacgata gccgcgctgc ctcgtcctgc 8640 

ttgacaaaaa gaaccgggcg cccctgcgct 8700 

ccgattgtct gttgtgccca gtcatagccg 8760 

cctgcgtgca atccatcttg ttcaatcatg 8820 

tggattgaga gtgaatatga gactctaatt 8880 

tggagcattt ttgacaagaa atatttgcta 8940 

gcgcaataat ggtttctgac gtatgtgctt 9000 
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agctcattaa actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca 9060 

gttccaaacg taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt 9120 

aattctccgc tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg 9180 

acaggatata ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat 9240 

ttaaaagggc gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg 9300 

gttccccaga tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat 9360 

ccgacagcgc gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca 9420 

gaatgccata gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca 9480 

gcaccggcat aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga 9540 

tcaggggtat gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa 9600 

cgcgcggatt ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt 9660 

gtcaagcatg acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa 9720 

cgaggtcggc gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca 9780 

gccggcgctt tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc 9840 

catgctggcg gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt 9900 

tctgatcggg aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg 9960 

catccatgcc ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg 10020 

cttcctctgc gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag 10080 

ctacttcact gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg 10140 

cggcggcacc gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt 10200 
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cgacgaagcc ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt 10260 

ggcgaaaagg aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc 10320 

aggaccgctg ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct 10380 

ccccctttcc accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc 104 40 

cctagcgtcc aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc 10500 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 10560 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 10620 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 10680 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 10740 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 10800 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 108 60 

gcgcttttcc gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc 10920 

atcctttttc gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg 10980 

tatccaacgg cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc 11040 

cttcttcact gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg 11100 

ctggccggct accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc 11160 

aaccaggaag ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat 11220 

tgaggaaaag gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca 11280 

gggctacaaa atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa 11340 

tggcgacctg ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac 11400 

ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga 114 60 
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cgagcttggc aaggtcatga tgggcgtggt 

gccgctaaaa cggccggggg gtgcgcgtga 

agaagagcga cttcgcggag ctggtgaagt 

gcctttgcga cgctcaccgg gctggttgcc 

cctgcaaacg cgccagaaac gccgtcgaag 

tgtggatacc tcgcggaaaa cttggccctc 

tgaggggccg actcacccgg cgcggcgttg 

gcgacgtgga gctggccagc ctcgcaaatc 

ccacagatga tgtggacaag cctggggata 

gcgactactg acagatgagg ggcgcgatcc 

tgaggggcgc acctattgac atttgagggg 

aagggtttcc gcccgttttt cggccaccgc 

atatttataa accttgtttt taaccagggc 

aaggggggtg cccccccttc tcgaaccctc 

ccaggggctg cgcccctcgg ccgcgaacgg 

ccttgccatt gccgggatcg gggcagtaac 

cggaagcatt gacgtgccgc aggtgctggc 

tgagggcggc ggcctgggtg gcggcctgcc 

cttcatggcg gggccggcaa tttttacctt 

cgtgctcgtg ttcgggggtg cgataaaccc 
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ccgcccgagg gcagagccat gactttttta 11520 

ttgccaagca cgtccccatg cgctccatca 11580 

acatcaccga cgagcaaggc aagaccgagc 11640 

ctcgccgctg ggctggcggc cgtctatggc 11700 

ccgtgtgcga gacaccgcgg ccgccggcgt 117 60 

actgacagat gaggggcgga cgttgacact 11820 

acagatgagg ggcaggctcg atttcggccg 11880 

ggcgaaaacg cctgatttta cgcgagtttc 11940 

agtgccctgc ggtattgaca cttgaggggc 12000 

ttgacacttg aggggcagag tgctgacaga 12060 

ctgtccacag gcagaaaatc cagcatttgc 12120 

taacctgtct tttaacctgc ttttaaacca 12180 

tgcgccctgt gcgcgtgacc gcgcacgccg 12240 

ccggcccgct aacgcgggcc tcccatcccc 12300 

cctcacccca aaaatggcag cgctggcagt 12360 

gggatgggcg atcagcccga gcgcgacgcc 12420 

atcgacattc agcgaccagg tgccgggcag 12480 

cttcacttcg gccgtcgggg cattcacgga 12540 

gggcattctt ggcatagtgg tcgcgggtgc 12600 

agcgaaccat ttgaggtgat aggtaagatt 12660 
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ataccgaggt atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat 12720 

ttaaaaagct accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat 12780 

attgacaata ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga 12840 

tttcaggggg caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca 12900 

taaaaacttg catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt 12960 

ctatcataat tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc 13020 

gatgactttg tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg 13080 

tgccaggtgc tgcctcagat tcaggrtatg ccgctcaatt cgctgcgtat atcgcttgct 13140 

gattacgtgc agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca 13200 

tatcaccacg tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg 13260 

ttcaccgaat acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca 13320 

gcgctggcgc gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat 13380 

gacgtcactg cccggctgtra tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga 13440 

cgtaaaatcg tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca 13500 

ttcatggcca tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac 13560 

tgcagttgcc atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt 13620 

ttgccgttac gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa 13680 

gccactggag cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc 13740 

cataattgtg gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac 13800 

aactttgaaa aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg 13860 

gagttcgtct tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa 13920 
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ggaaataata aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat 13980 

accgctgcgt aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag 14040 

aaaatgaaaa cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg 14100 

tggaacggga aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc 14160 

tgcactttga acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc 14220 

tttgctcgga agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg 14280 

agtgcatcag gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag 14340 

acagccgctt agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg 14400 

aaaactggga agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga 144 60 

cggaaaagcc cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct 14520 

ttgtgaaaga tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca 14580 

agtggtatga cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt 14640 

atgtcgagct attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt 14700 

atattttact ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag 14760 

caggagcgca ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc 14820 

aagtatttgg gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac 14880 

gagaaggacg gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg . 14 940 

gacaccaagg caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc 15000 

ggggcaatcc cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa 15060 

gaactgatcg acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc 15120 
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atgcgtgcgc cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc 15180 

aagatcgagc gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc 15240 

gtggagcgtt cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc 15300 

gacacgcgag gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa 15360 

caggtcagcg aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa 15420 

atgcagcttt ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac 15480 

gacacggccc gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg 15540 

caaaacaagg tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag 15600 

ctgcgggccg acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc 15660 

cctatcggcg agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg 15720 

atcaatggcc ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg 15780 

atgggcttca cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc 15840 

cgcgtcctgg accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc 15900 

gtcgtgctgt ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg 15960 

tcgccgacgg cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc 16020 

aagctggaaa ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc 16080 

gagcaggtcg gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg 16140 

gtcaatgatg acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg 16200 

ggttcagcag ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact 16260 

tgcttcgctc agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag 16320 

gattaaaatt gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc 16380 
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aggatttccg cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg 16440 

tttacgagca cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg 16500 

tggcattcgg cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg 16560 

acggccccaa ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc 16620 

gaggccgagg ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga 16680 

tgatcgtccg acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac 16740 

ttaatatttc gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg 16800 

tcgcggcgac ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc 16860 

taggtagccc gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg 16920 

cgctgttggt gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg 16980 

cgggggcggt ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc 17040 

ctctgctcac ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag 17100 

ctttagtgtt tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt 17160 

ggctcggcct gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac 17220 

tcgaacctac agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc 17280 

cggggatgca tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag 17340 

caatggatag gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc 17400 

ttcctcagcg gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca 17460 

gcctgtcacg gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg 17520 

agatgatatt tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct 17580 
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ccgcgagatc atccgtgttt 
ggtaacatga gcaaagtctg 
gatgggctgc ctgtatcgag 
ctggctggtg gcaggatata 
acattgcgga cgtttttaat 
cagctgattg cccttcaccg 
ttgccccagc aggcgaaaat 
aaatcaaaag aatagcccga 
ctattaaaga acgtggactc 
ccactacgtg aaccatcacc 
aatcggaacc ctaaagggag 
gcgagaaagg aagggaagaa 
ggaagggcga tcggtgcggg 
tgcaaggcga ttaagttggg 
ggccagtgaa ttcgagctcg 

<210> 41 

<211> 18449 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 
<220> 
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caaacccggc agcttagttg ccgttcttcc gaatagcatc 17640 

ccgccttaca acggctctcc cgctgacgcc gtcccggact 17700 

tggtgatttt gtgccgagct gccggtcggg gagctgttgg 17760 

ttgtggtgta aacaaattga cgcttagaca acttaataac 17820 

gtactggggt ggtttttctt ttcaccagtg agacgggcaa 17880 

cctggccctg agagagttgc agcaagcggt ccacgctggt 17940 

cctgtttgat ggtggttccg aaatcggcaa aatcccttat 18000 

gatagggttg agtgttgttc cagtttggaa caagagtcca 18060 

caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 18120 

caaatcaagt tttttggggt cgaggtgccg taaagcacta 18180 

cccccgattt agagcttgac ggggaaagcc ggcgaacgtg 18240 

agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg 18300 

cctcttcgct attacgccag ctggcgaaag ggggatgtgc 18360 

taacgccagg gttttcccag tcacgacgtt gtaaaacgac 18420 

gtacccggg 18449 
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<221> misc_f eature 
<222> (3471) . . (3471) 
<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3770) . . (3770) 

<223> n is a, c, g, or t 

<400> 41 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 
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tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 
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taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 24 60 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 
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tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caaagcttga gattaaaata gataaggaaa 4020 

agaaagtgaa aagaaattcg gaagcatggc acattcttct ttttataaat acatgcctga 4080 

ctttcttttt ccatcgatat gatatatgca tatgatagat atacaagcaa tcttcttcaa 4140 

ggagtttgaa attttgtcct ccaggagcaa aaaaaagttt ttttttatac atgtttgtac 4200 

acaagaatag ttaccaattt gctttggtct tacgtgctgc aagtttatat cgttttcaat 4260 

ttctttgtct ttacattttc tttgtccttt atctttcctc atttagtctt tgggagaatt 4320 

aggaaaaggg agcggaaagg taagaaatgc ttgcgtattt tactaattcg gcaaacatcc 4 380 

aatttggcaa acagcagcct gtgcaacgct ctcgagatga cagtatcttt gattacactc 4440 

taaatctcga tgacccgacc aaaaagagcg aacaaagaaa taatcttgtg cattcgaata 4500 
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tgatggaaga ttttttcccc cttattctaa atgttgacat agcgtgtatg ttatataaac 4560 

aaaaagaaat tgtacaaact ttcttttctt ctctttttat tttatctcta tgctgtcgaa 4 620 

gctgcagtca atcagcgtca aggcccgccg cgttgaacta gcccgcgaca tcacgcggcc 4 680 

caaagtctgc ctgcatgctc agcggtgctc gttagttcgg ctgcgagtgg cagcaccaca 4740 

gacagaggag gcgctgggaa ccgtgcaggc tgccggcgcg ggcgatgagc acagcgccga 4800 

tgtagcactc cagcagcttg accgggctat cgcagagcgt cgtgcccggc gcaaacggga 4860 

gcagctgtca taccaggctg ccgccattgc agcatcaatt ggcgtgtcag gcattgccat 4 920 

cttcgccacc tacctgagat ttgccatgca catgaccgtg ggcggcgcag tgccatgggg 4 980 

tgaagtggct ggcactctcc tcttggtggt tggtggcgcg ctcggcatgg agatgtatgc 5040 

ccgctatgca cacaaagcca tctggcatga gtcgcctctg ggctggctgc tgcacaagag 5100 

ccaccacaca cctcgcactg gaccctttga agccaacgac ttgtttgcaa tcatcaatgg 5160 

actgcccgcc atgctcctgt gtacctttgg cttctggctg cccaacgtcc tgggggcggc 5220 

ctgctttgga gcggggctgg gcatcacgct atacggcatg gcatatatgt ttgtacacga 5280 

tggcctggtg cacaggcgct ttcccaccgg gcccatcgct ggcctgccct acatgaagcg 5340 

cctgacagtg gcccaccagc tacaccacag cggcaagtac ggtggcgcgc cctggggtat 5400 

gttcttgggt ccacaggagc tgcagcacat tccaggtgcg gcggaggagg tggagcgact 54 60 

ggtcctggaa ctggactggt ccaagcgggc gattgtgact gatagcgaga ctctgggtcg 5520 

atgttatctg cctcaacaat ggcttagaaa agaagaaaca gaacaaatac agcaaggcaa 5580 

cgcccgtagc ctaggtgatc aaagactgtt gggcttgtct ctgaagcttg taggaaaggc 5640 

agacgctatc atggtgagag ctaagaaggg cattgacaag ttgccggcaa actgtcaagg 5700 
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cggtgtacga gctgcttgcc aagtatatgc tgcaattgga tctgtactca agcagcagaa 5760 

gacaacatat cctacaagag ctcatctaaa aggaagcgaa cgtgccaaga ttgctctgtt 5820 

gagtgtatac aacctctatc aatctgaaga caagcctgtg gctctccgtc aagctagaaa 5880 

gattaagagt ttttttgttg attagtgaat ttttgtttta tttatgtctg atagttcaat 5940 

aaagagacaa cacatacaat ataaaatcat tgtctttaaa tgttaattta gtagagtgta 6000 

aagcctgcat tttttttgta cgcataaaca atgaattcac cccgcttctg gtttttaaat 6060 

aattatgtca aactagggaa aattcttttt tttctcttcg ttcttttttt ggcttgttgt 6120 

ggagtcacag gcttgtcttc agattgatag aggttgtata cactcaacag agcaatcttg 6180 

gcacgttcgc ttccttttag atgagctctt gtaggatatg ttgtcttctg ctgcttgagt 6240 

acagatccaa ttgcagcata tacttggcaa gcagctcgta caccgccttg acagtttgcc 6300 

ggcaacttgt caatgccctt cttagctctc accatgatag cgtctgcctt tcctacaagc 6360 

ttcagagaca agcccaacag tctttgatca cctaggctac gggcgttgcc ttgctgtatt 6420 

tgttctgttt cttcttttct aagccattgt tgaggcagat aacatcgacc caacatcctc 6480 

gagccatact acagcataaa aggatacgtt ttctttaaca gaaatttacc cttttgttat 6540 

cagcacatac aaaaaaaaag aaatttaaga tgagtaggac ttccattctc tcaaaaattt 6600 

tattcaatcc ataaatgaat tatttttgga caaaaaagaa agattatgcc tgattttctc 6660 

tatttttttt ttttttacaa ctccaccaat actttctagc ccagcttggc gtaatcatgg 6720 

tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc 6780 

ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 6840 

ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 6900 

ggccaacgcg cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca 6960 
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accgattgag ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca 7020 

ccgtcaccga cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta 7080 

ccattagcaa ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta 7140 

gcgacagaat caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg 7200 

gtcatagccc ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag 7260 

ccaccaccgg aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca 7320 

ccaccctcag agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc 7380 

ccgatctagt aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat 7440 

tttgttttct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct 7500 

cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa 7560 

ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca 7620 

aatgtttgaa cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga 7680 

acgcagcaag atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt 7740 

gatgtggacg ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc 7800 

cgttgctgtc gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc 7860 

gaagaactcc agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac 7920 

gattccgaag cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgafggcag 7980 

gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca 8040 

agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 8100 

aagcggtcag cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg 8160 
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tcctgatagc ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca 8220 

ttttccacca tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg 8280 

tcgggcatgc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 8340 

tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg 8400 

cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 8460 

attgcatcag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc 8520 

tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc 8580 

acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc 8640 

agttcattca gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct 8700 

gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 8760 

aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 8820 

cgaaacgatc cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt 8880 

ggataccgag gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta 8940 

gctgatagtg accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt 9000 

agctcattaa actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca 9060 

gttccaaacg taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt 9120 

aattctccgc tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg 9180 

acaggatata ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat 9240 

ttaaaagggc gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg 9300 

gttccccaga tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat 9360 

ccgacagcgc gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca 9420 
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gaatgccata gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca 9480 

gcaccggcat aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga 9540 

tcaggggtat gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa 9600 

cgcgcggatt ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt 9660 

gtcaagcatg acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa 9720 

cgaggtcggc gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca 9780 

gccggcgctt tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc 9840 

catgctggcg gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt 9900 

tctgatcggg aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg 9960 

catccatgcc ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg 10020 

cttcctctgc gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag 10080 

ctacttcact gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg 10140 

cggcggcacc gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt 10200 

cgacgaagcc ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt 10260 

ggcgaaaagg aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc 10320 

aggaccgctg ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct 10380 

ccccctttcc accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc 10440 

cctagcgtcc aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc 10500 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 10560 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 10620 
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tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 10680 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 10740 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 10800 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 10860 

gcgcttttcc gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc 10920 

atcctttttc gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg 10980 

tatccaacgg cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc 11040 

cttcttcact gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg 11100 

ctggccggct accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc 11160 

aaccaggaag ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat 11220 

tgaggaaaag gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca 11280 

gggctacaaa atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa 11340 

tggcgacctg ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac 11400 

ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga 11460 

cgagcttggc aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta 11520 

gccgctaaaa cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca 11580 

agaagagcga cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc 11640 

gcctttgcga cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc 11700 

cctgcaaacg cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt 11760 

tgtggatacc tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact 11820 

tgaggggccg actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg 11880 
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gcgacgtgga gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc 11940 

ccacagatga tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc 12000 

gcgactactg acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga 12060 

tgaggggcgc acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc 12120 

aagggtttcc gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca 12180 

atatttataa accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg 12240 

aaggggggtg cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc 12300 

ccaggggctg cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt 12360 

ccttgccatt gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc 12420 

cggaagcatt gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag 12480 

tgagggcggc ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga 12540 

cttcatggcg gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc 12600 

cgtgctcgtg ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt 12660 

ataccgaggt atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat 12720 

ttaaaaagct accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat 12780 

attgacaata ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga 12840 

tttcaggggg caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca 12900 

taaaaacttg catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt 12960 

ctatcataat tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc 13020 

gatgactttg tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg 13080 
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tgccaggtgc tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct 13140 

gattacgtgc agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca 13200 

tatcaccacg tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg 13260 

ttcaccgaat acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca 13320 

gcgctggcgc gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat 13380 

gacgtcactg cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga 13440 

cgtaaaatcg tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca 13500 

ttcatggcca tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac 13560 

tgcagttgcc atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt 13620 

ttgccgttac gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa 13680 

gccactggag cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc 13740 

cataattgtg gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac 13800 

aactttgaaa aagctgtttrt ctggtattta aggttttaga atgcaaggaa cagtgaattg 13860 

gagttcgtct tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa 13920 

ggaaataata aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat 13980 

accgctgcgt aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag 14040 

aaaatgaaaa cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg 14100 

tggaacggga aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ' ccaaaggtcc 14160 

tgcactttga acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc 14220 

tttgctcgga agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg 14280 

agtgcatcag gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag 14340 
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acagccgctt agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg 14400 

aaaactggga agaagacact ccatttaaag atccgcgcga gctgtatgat . tttttaaaga 144 60 

cggaaaagcc cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct 14520 

ttgtgaaaga tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca 14580 

agtggtatga cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt 14640 

atgtcgagct attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt 14700 

atattttact ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag 14760 

caggagcgca ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc 14820 

aagtatttgg gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac 14880 

gagaaggacg gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg 14940 

gacaccaagg caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc 15000 

ggggcaatcc cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa 15060 

gaactgatcg acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc 15120 

atgcgtgcgc cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc 15180 

aagatcgagc gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc 15240 

gtggagcgtt cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc 15300 

gacacgcgag gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa 15360 

caggtcagcg aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa 15420 

atgcagcttt ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac 15480 

gacacggccc gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg 15540 
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caaaacaagg tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag 15600 

ctgcgggccg acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc 15660 

cctatcggcg agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg 15720 

atcaatggcc ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg 15780 

atgggcttca cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc 15840 

cgcgtcctgg accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc 15900 

gtcgtgctgt ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg 15960 

tcgccgacgg cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc 16020 

aagctggaaa ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc 16080 

gagcaggtcg gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg 1614 0 

gtcaatgatg acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg 16200 

ggttcagcag ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact 16260 

tgcttcgctc agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag 16320 

gattaaaatt gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc 16380 

aggatttccg cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg 16440 

tttacgagca cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg 16500 

tggcattcgg cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg 16560 

acggccccaa ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc 16620 

gaggccgagg ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga 16680 

tgatcgtccg acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac 16740 

ttaatatttc gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg 16800 



BASF AG 

BASF NAE 877/03 



172/365 



January 08, 2004 



tcgcggcgac ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc 16860 

taggtagccc gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg 16920 

cgctgttggt gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg 16980 

cgggggcggt ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc 17040 

ctctgctcac ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag 17100 

ctttagtgtt tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt 17160 

ggctcggcct gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac 17220 

tcgaacctac agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc 17280 

cggggatgca tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag 17340 

caatggatag gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc 17400 

ttcctcagcg gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca 17460 

gcctgtcacg gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg 17520 

agatgatatt tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct 17580 

ccgcgagatc atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc 17640 

ggtaacatga gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact 17700 

gatgggctgc ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg 17760 

ctggctggtg gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac 17820 

acattgcgga cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa 17880 

cagctgattg cccttcaccg cctggccctg agagagttgc agcaagcggt ccacgctggt 17940 

ttgccccagc aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat 18000 
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aaatcaaaag aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca 18060 

ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 18120 

ccactacgtg aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta 18180 

aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg 18240 

gcgagaaagg aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg 18300 

ggaagggcga tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc 18360 

tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac 18420 

ggccagtgaa ttcgagctcg gtacccggg 18449 

<210> 42 

<211> 17593 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 
<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 
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<220> 
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<400> 42 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggg.gcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 
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ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 
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catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

t 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 
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tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 
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tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 57.60 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 
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tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 
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tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 
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cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctrg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 
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ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt ttttcgagtt 10800 

tttttttttt ttctttgtga aggatttatt gttattggta tccatttttt attggaagac 10860 

aagataagtt aatattgatt ttgcttaaag attaaaagga aatcagaaaa cgacaataaa 10920 

aaatgtaacg gacaaactat ggtgtcgatt ataagtctaa atccttaaaa aatgacaacg 10980 

agttgctttc ctctgaaaac aattcttttg tctttgcaag aaaggtttct tttttgtttg 11040 
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cttgcattac ttaaacatca aatcaaatga aaggaataaa gcagatttga gggcgaataa 11100 

ggattttctg gtcaacaaga tgtgagtgac acctaaggaa ctaaatgcca ttcatttgtt 11160 

ttaaaacgac atcaaagatt gatgatcaac aggattgaga gagagaaaaa gaactcgtgt 11220 

catttatttc tgttgactga aattttatat ttagaaaaaa tgtcaaatct atagctttag 11280 

ctatattaca taacatttga aataataata ataaaaaaag acacattaga gacacttttc 11340 

aaactctaaa taactgtcta taaacacaaa gaaaacaaag acctctataa caacttatta 11400 

gatttttctc gtacttttgt ctaaagatga tgtattcttg ttatcccaca cttctttcat 11460 

ttgttcttga tgctactaaa tatacaaaat ttcttttttg caagagatat tattccaaaa 11520 

attttcaaaa agaaattttt ttcacaatag cagttgatcg tgtaacccaa agaggttctt 11580 

tgttattttg cacttccgct ttgcggtgat gcatattcaa agtaatatat ggaataaaca 11640 

acgtgtttaa gcatgaaaga aaggaaacaa aggccgcttt gaacaaatgc ataatatttc 11700 

agacaaaaat gatctaaagc aagcagtaaa tcaaacaaga aacattgctg attcgcgtta 11760 

gaaaacgata aaagtctaat aagccactaa gtatacttca atgaactttt tgtatgctta 11820 

tggtccaatc agaccaataa tttgtgacca ttcctgaggt ggctttggtg atgcggaaac 11880 

agaaaaaaat tttctcacca atcgatttaa aaaacaattt ctgctttgaa ccaaaacttt 11940 

ttttttctct ttaatcatta actttatcaa gtatgtacct accctcaaag tcctcactca 12000 

agcacaatta tgctaacatt gttccacctt ctctttagaa atgttgtgga tttggaatgc 12060 

cctgatcgtt ttcgttaccg tgattggcat ggaagtgatt gctgcactgg cacacaaata 12120 

catcatgcac ggctggggtt ggggatggca tctttcacat catgaaccgc gtaaaggtgc 12180 

gtttgaagtt aacgatcttt atgccgtggt ttttgctgca ttatcgatcc tgctgattta 12240 

tctgggcagt acaggaatgt ggccgctcca gtggattggc gcaggtatga cggcgtatgg 12300 
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attactctat tttatggtgc acgacgggct ggtgcatcaa cgttggccat tccgctatat 12360 

tccacgcaag ggctacctca aacggttgta tatggcgcac cgtatgcatc acgccgtcag 12420 

gggcaaagaa ggttgtgttt cttttggctt cctctatgcg ccgcccctgt caaaacttca 12480 

ggcgacgctc cgggaaagac atggcgctag agcgggcgct gccagagatg cgcagggcgg 12540 

ggaggatgag cccgcatccg ggaagtaagg gcctgaccag aggcggccag cagcagcgtt 12600 

aatttttcgg gcgtggtcgt tgactgccgc tgatcccaaa gcttggcgta atcatggtca 12660 

tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc cacacaacat acgagccgga 12720 

agcataaagt gtaaagcctg gggtgcctaa tgagtgagct aactcacatt aattgcgttg 12780 

cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc agctgcatta atgaatcggc 12840 

caacgcgcgg ggagaggcgg tttgcgtatt gggccaaaga caaaagggcg acattcaacc 12900 

gattgaggga gggaaggtaa atattgacgg aaattattca ttaaaggtga attatcaccg 12960 

tcaccgactt gagccatttg ggaattagag ccagcaaaat caccagtagc accattacca 13020 

ttagcaaggc cggaaacgtc accaatgaaa ccatcgatag cagcaccgta atcagtagcg 13080 

acagaatcaa gtttgccttt agcgtcagac tgtagcgcgt tttcatcggc attttcggtc 13140 

atagccccct tattagcgtt tgccatcttt tcataatcaa aatcaccgga accagagcca 13200 

ccaccggaac cgcctccctc agagccgcca ccctcagaac cgccaccctc agagccacca 13260 

ccctcagagc cgccaccaga accaccacca gagccgccgc cagcattgac aggaggcccg 13320 

atctagtaac atagatgaca ccgcgcgcga taatttatcc tagtttgcgc gctatatttt 13380 

gttttctatc gcgtattaaa tgtataattg cgggactcta atcataaaaa cccatctcat 13440 

aaataacgtc atgcattaca tgttaattat tacatgctta acgtaattca acagaaatta 13500 
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tatgataatc atcgcaagac cggcaacagg attcaatctt aagaaacttt attgccaaat 13560 

gtttgaacga tcggggatca tccgggtctg tggcgggaac tccacgaaaa tatccgaacg 13620 

cagcaagata tcgcggtgca tctcggtctt gcctgggcag tcgccgccga cgccgttgat 13680 

gtggacgccg ggcccgatca tattgtcgct caggatcgtg gcgttgtgct tgtcggccgt 13740 

tgctgtcgta atgatatcgg caccttcgac cgcctgttcc gcagagatcc cgtgggcgaa 13800 

gaactccagc atgagatccc cgcgctggag gatcatccag ccggcgtccc ggaaaacgat 13860 

tccgaagccc aacctttcat agaaggcggc ggtggaatcg aaatctcgtg atggcaggtt 13S20 

gggcgtcgct tggtcggtca tttcgaaccc cagagtcccg ctcagaagaa ctcgtcaaga 13980 

aggcgataga aggcgatgcg ctgcgaatcg ggagcggcga taccgtaaag cacgaggaag 14040 

cggtcagccc attcgccgcc aagctcttca gcaatatcac gggtagccaa cgctatgtcc 14100 

tgatagcggt ccgccacacc cagccggcca cagtcgatga atccagaaaa gcggccattt 14160 

tccaccatga tattcggcaa gcaggcatcg ccatgggtca cgacgagatc atcgccgtcg 14220 

ggcatgcgcg ccttgagcct ggcgaacagt tcggctggcg cgagcccctg atgctcttcg 14280 

tccagatcat cctgatcgac aagaccggct tccatccgag tacgtgctcg ctcgatgcga 14340 

tgtttcgctt ggtggtcgaa tgggcaggta gccggatcaa gcgtatgcag ccgccgcatt 14400 

gcatcagcca tgatggatac tttctcggca ggagcaaggt gagatgacag gagatcctgc 14460 

cccggcactt cgcccaatag cagccagtcc cttcccgctt cagtgacaac gtcgagcaca 14520 

gctgcgcaag gaacgcccgt cgtggccagc cacgatagcc gcgctgcctc gtcctgcagt 14580 

tcattcaggg caccggacag gtcggtcttg acaaaaagaa ccgggcgccc ctgcgctgac 14640 

agccggaaca cggcggcatc agagcagccg attgtctgtt gtgcccagtc atagccgaat 14700 

agcctctcca cccaagcggc cggagaacct gcgtgcaatc catcttgttc aatcatgcga 14760 
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aacgatccag atccggtgca gattatttgg attgagagtg aatatgagac tctaattgga 14820 

taccgagggg aatttatgga acgtcagtgg agcatttttg acaagaaata tttgctagct 14880 

gatagtgacc ttaggcgact tttgaacgcg caataatggt ttctgacgta tgtgcttagc 14940 

tcattaaact ccagaaaccc gcggctgagt ggctccttca acgttgcggt tctgtcagtt 15000 

ccaaacgtaa aacggcttgt cccgcgtcat cggcgggggt cataacgtga ctcccttaat 15060 

tctccgctca tgatcagatt gtcgtttccc gccttcagtt taaactatca gtgtttgaca 15120 

ggatatattg gcgggtaaac ctaagagaaa agagcgttta ttagaataat cggatattta 15180 

aaagggcgtg aaaaggttta tccgttcgtc catttgtatg tgcatgccaa ccacagggtt 15240 

ccccagatct ggcgccggcc agcgaqacga gcaagattgg ccgccgcccg aaacgatccg 15300 

acagcgcgcc cagcacaggt gcgcaggcaa attgcaccaa cgcatacagc gccagcagaa 15360 

tgccatagtg ggcggtgacg tcgttcgagt gaaccagatc gcgcaggagg cccggcagca 15420 

ccggcataat caggccgatg ccgacagcgt cgagcgcgac agtgctcaga attacgatca 15480 

ggggtatgtt gggtttcacg tctggcctcc ggaccagcct ccgctggtcc gattgaacgc 15540 

gcggattctt tatcactgat aagttggtgg acatattatg tttatcagtg ataaagtgtc 15600 

aagcatgaca aagttgcagc cgaatacagt gatccgtgcc gccctggacc tgttgaacga 15660 

ggtcggcgta gacggtctga cgacacgcaa actggcggaa cggttggggg ttcagcagcc 15720 

ggcgctttac tggcacttca ggaacaagcg ggcgctgctc gacgcactgg ccgaagccat 15780 

gctggcggag aatcatacgc attcggtgcc gagagccgac gacgactggc gctcatttct 15840 

gatcgggaat gcccgcagct tcaggcaggc gctgctcgcc taccgcgatg gcgcgcgcat 15900 

ccatgccggc acgcgaccgg gcgcaccgca gatggaaacg gccgacgcgc agcttcgctt 15960 
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cctctgcgag gcgggttttt cggccgggga cgccgtcaat gcgctgatga caatcagcta 16020 

cttcactgtt ggggccgtgc ttgaggagca ggccggcgac agcgatgccg gcgagcgcgg 16080 

cggcaccgtt gaacaggctc cgctctcgcc gctgttgcgg gccgcgatag acgccttcga 16140 

cgaagccggt ccggacgcag cgttcgagca gggactcgcg gtgattgtcg atggattggc 16200 

gaaaaggagg ctcgttgtca ggaacgttga aggaccgaga aagggtgacg attgatcagg 16260 

accgctgccg gagcgcaacc cactcactac agcagagcca tgtagacaac atcccctccc 16320 

cctttccacc gcgtcagacg cccgtagcag cccgctacgg gctttttcat gccctgccct 16380 

agcgtccaag cctcacggcc gcgctcggcc tctctggcgg ccttctggcg ctcttccgct 16440 

tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac 16500 

tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga 16560 

gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat 16620 

aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac 16680 

ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct 16740 

gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg 16800 

cttttccgct gcataaccct gcttcggggt cattatagcg attttttcgg tatatccatc 16860 

ctttttcgca cgatatacag gattttgcca aagggttcgt gtagactttc cttggtgtat 16920 

ccaacggcgt cagccgggca ggataggtga agtaggccca cccgcgagcg ggtgttcctt 16980 

cttcactgtc ccttattcgc acctggcggt gctcaacggg aatcctgctc tgcgaggctg 17040 

gccggctacc gccggcgtaa cagatgaggg caagcggatg gctgatgaaa ccaagccaac 17100 

caggaagggc agcccaccta tcaaggtgta ctgccttcca gacgaacgaa gagcgattga 17160 

ggaaaaggcg gcggcggccg gcatgagcct gtcggcctac ctgctggccg tcggccaggg 17220 
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ctacaaaatc acgggcgtcg tggactatga gcacgtccgc gagctggccc gcatcaatgg 17280 

cgacctgggc cgcctgggcg gcctgctgaa actctggctc accgacgacc cgcgcacggc 17340 

gcggttcggt gatgccacga tcctcgccct gctggcgaag atcgaagaga agcaggacga 17400 

gcttggcaag gtcatgatgg gcgtggtccg cccgagggca gagccatgac ttttttagcc 17460 

gctaaaacgg ccggggggtg cgcgtgattg ccaagcacgt ccccatgcgc tccatcaaga 17520 

agagcgactt cgcggagctg gtgaagtaca tcaccgacga gcaaggcaag accgagcgcc 17580 

tttgcgacgc tea 17593 
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ccgggctggt tgccctcgcc gctgggctgg 
aaacgccgtc gaagccgtgt gcgagacacc 
aaaacttggc cctcactgac agatgagggg 
ccggcgcggc gttgacagat gaggggcagg 
cagcctcgca aatcggcgaa aacgcctgat 
caagcctggg gataagtgcc ctgcggtatt 
gaggggcgcg atccttgaca cttgaggggc 
tgacatttga ggggctgtcc acaggcagaa 
ttttcggcca ccgctaacct gtcttttaac 
tttttaacca gggctgcgcc ctgtgcgcgt 
cttctcgaac cctcccggcc cgctaacgcg 
tcggccgcga acggcctcac cccaaaaatg 
atcggggcag taacgggatg ggcgatcagc 
ccgcaggtgc tggcatcgac attcagcgac 
ggtggcggcc tgcccttcac ttcggccgtc 
gcaattttta ccttgggcat tcttggcata 
ggtgcgataa acccagcgaa ccatttgagg 
acgagaattg gacctttaca gaattactct 
acgaagagga tgaagaggat gaggaggcag 
agataatata tcttttatat agaagatatc 
ataggcagcg cgcttatcaa tatatctata 
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cggccgtcta tggccctgca aacgcgccag 60 

gcggccgccg gcgttgtgga tacctcgcgg 120 

cggacgttga cacttgaggg gccgactcac 180 

ctcgatttcg gccggcgacg tggagctggc 240 

tttacgcgag tttcccacag atgatgtgga 300 

gacacttgag gggcgcgact actgacagat 360 

agagtgctga cagatgaggg gcgcacctat 420 

aatccagcat ttgcaagggt ttccgcccgt 480 

ctgcttttaa accaatattt ataaaccttg 540 

gaccgcgcac gccgaagggg ggtgcccccc 600 

ggcctcccat ccccccaggg gctgcgcccc 660 

gcagcgctgg cagtccttgc cattgccggg 720 

ccgagcgcga cgcccggaag cattgacgtg 780 

caggtgccgg gcagtgaggg cggcggcctg 840 

ggggcattca cggacttcat ggcggggccg 900 

gtggtcgcgg gtgccgtgct cgtgttcggg 960 

tgataggtaa gattataccg aggtatgaaa 1020 

atgaagcgcc atatttaaaa agctaccaag 1080 

attgccttga atatattgac aatactgata 1140 

gccgtatgta aggatttcag ggggcaaggc 1200 

gaatgggcaa agcataaaaa cttgcatgga 1260 
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ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 
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catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 
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agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 
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tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 
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taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 
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tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacgga-c ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 8460 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 
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tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tcctt.cccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 
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ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat. aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 
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tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 

tatgttgtgg atttggaatg ccctgatcgt tttcgttacc gtgattggca tggaagtgat 11460 

tgctgcactg gcacacaaat acatcatgca cggctggggt tggggatggc atctttcaca 11520 

tcatgaaccg cgtaaaggtg cgtttgaagt taacgatctt tatgccgtgg tttttgctgc 11580 

attatcgatc ctgctgattt atctgggcag tacaggaatg tggccgctcc agtggattgg 11640 

cgcaggtatg acggcgtatg gattactcta ttttatggtg cacgacgggc tggtgcatca 11700 

acgttggcca ttccgctata ttccacgcaa gggctacctc aaacggttgt atatggcgca 11760 

ccgtatgcat cacgccgtca ggggcaaaga aggttgtgtt tcttttggct tcctctatgc 11820 

gccgcccctg tcaaaacttc aggcgacgct ccgggaaaga catggcgcta gagcgggcgc 11880 

tgccagagat gcgcagggcg gggaggatga gcccgcatcc gggaagtaag ggcctgacca 11940 

gaggcggcca gcagcagcgt taatttttcg ggcgtggtcg ttgactgccg ctgatcccaa 12000 

agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 12060 

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 12120 

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 12180 

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggccaaag 12240 

acaaaagggc gacattcaac cgattgaggg agggaaggta aatattgacg gaaattattc 12300 
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attaaaggtg aattatcacc gtcaccgact tgagccattt gggaattaga gccagcaaaa 12360 

tcaccagtag caccattacc attagcaagg ccggaaacgt caccaatgaa accatcgata 12420 

gcagcaccgt aatcagtagc gacagaatca agtttgcctt tagcgtcaga ctgtagcgcg 12480 

ttttcatcgg cattttcggt catagccccc ttattagcgt ttgccatctt ttcataatca 12540 

aaatcaccgg aaccagagcc accaccggaa ccgcctccct cagagccgcc accctcagaa 12600 

ccgccaccct cagagccacc accctcagag ccgccaccag aaccaccacc agagccgccg 12660 

ccagcattga caggaggccc gatctagtaa catagatgac accgcgcgcg ataatttatc 12720 

ctagtttgcg cgctatattt tgttttctat cgcgtattaa atgtataatt gcgggactct 12780 

aatcataaaa acccatctca taaataacgt catgcattac atgttaatta ttacatgctt 12840 

aacgtaattc aacagaaatt atatgataat catcgcaaga ccggcaacag gattcaatct 12900 

taagaaactt tattgccaaa tgtttgaacg atcggggatc atccgggtct gtggcgggaa 12960 

ctccacgaaa atatccgaac gcagcaagat atcgcggtgc atctcggtct tgcctgggca 13020 

gtcgccgccg acgccgttga tgtggacgcc gggcccgatc atattgtcgc tcaggatcgt 13080 

ggcgttgtgc ttgtcggccg ttgctgtcgt aatgatatcg gcaccttcga ccgcctgttc 13140 

cgcagagatc ccgtgggcga agaactccag catgagatcc ccgcgctgga ggatcatcca 13200 

gccggcgtcc cggaaaacga ttccgaagcc caacctttca tagaaggcgg cggtggaatc 13260 

gaaatctcgt gatggcaggt tgggcgtcgc ttggtcggtc atttcgaacc ccagagtccc 13320 

gctcagaaga actcgtcaag aaggcgatag aaggcgatgc gctgcgaatc gggagcggcg 13380 

ataccgtaaa gcacgaggaa gcggtcagcc cattcgccgc caagctcttc agcaatatca 13440 

cgggtagcca acgctatgtc ctgatagcgg tccgccacac ccagccggcc acagtcgatg 13500 

aatccagaaa agcggccatt ttccaccatg atattcggca agcaggcatc gccatgggtc 13560 
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acgacgagat catcgccgtc gggcatgcgc gccttgagcc tggcgaacag ttcggctggc 13620 

gcgagcccct gatgctcttc gtccagatca tcctgatcga caagaccggc ttccatccga 13680 

gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatca 13740 

agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg 13800 

tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct 13860 

tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc 13920 

cgcgctgcct cgtcctgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga 13980 

accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt 14040 

tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat 14100 

ccatcttgtt caatcatgcg aaacgatcca gatccggtgc agattatttg gattgagagt 14160 

gaatatgaga ctctaattgg ataccgaggg gaatttatgg aacgtcagtg gagcattttt 14220 

gacaagaaat atttgctagc tgatagtgac cttaggcgac ttttgaacgc gcaataatgg 14280 

tttctgacgt atgtgcttag ctcattaaac tccagaaacc cgcggctgag tggctccttc 14340 

aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 14400 

tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 14460 

ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 14520 

attagaataa tcggatattt aaaagggcgt gaaaaggttt atccgttcgt ccatttgtat 14580 

gtgcatgcca accacagggt tccccagatc tggcgccggc cagcgagacg agcaagattg 14 640 

gccgccgccc gaaacgatcc gacagcgcgc ccagcacagg tgcgcaggca aattgcacca 14700 

acgcatacag cgccagcaga atgccatagt gggcggtgac gtcgttcgag tgaaccagat 14760 
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cgcgcaggag gcccggcagc accggcataa tcaggccgat gccgacagcg tcgagcgcga 14820 

cagtgctcag aattacgatc aggggtatgt tgggtttcac gtctggcctc cggaccagcc 14 880 

tccgctggtc cgattgaacg cgcggattct ttatcactga taagttggtg gacatattat 14940 

gtttatcagt gataaagtgt caagcatgac aaagttgcag ccgaatacag tgatccgtgc 15000 

cgccctggac ctgttgaacg aggtcggcgt agacggtctg acgacacgca . aactggcgga 15060 

acggttgggg gttcagcagc cggcgcttta ctggcacttc aggaacaagc gggcgctgct 15120 

cgacgcactg gccgaagcca tgctggcgga gaatcatacg cattcggtgc cgagagccga 15180 

cgacgactgg cgctcatttc tgatcgggaa tgcccgcagc ttcaggcagg cgctgctcgc 15240 

ctaccgcgat ggcgcgcgca tccatgccgg cacgcgaccg ggcgcaccgc agatggaaac 15300 

ggccgacgcg cagcttcgct tcctctgcga ggcgggtttt tcggccgggg acgccgtcaa 15360 

tgcgctgatg acaatcagct acttcactgt tggggccgtg cttgaggagc aggccggcga 15420 

cagcgatgcc ggcgagcgcg gcggcaccgt tgaacaggct ccgctctcgc cgctgttgcg 15480 

ggccgcgata gacgccttcg acgaagccgg tccggacgca gcgttcgagc agggactcgc 15540 

ggtgattgtc gatggattgg cgaaaaggag gctcgttgtc aggaacgttg aaggaccgag 15600 

aaagggtgac gattgatcag gaccgctgcc ggagcgcaac ccactcacta cagcagagcc 15660 

atgtagacaa catcccctcc ccctttccac cgcgtcagac gcccgtagca gcccgctacg 15720 

ggctttttca tgccctgccc tagcgtccaa gcctcacggc cgcgctcggc ctctctggcg 15780 

gccttctggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 15840 

cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat 15900 

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 15960 

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 16020 
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tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 16080 

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 16140 

ctcccttcgg gaagcgtggc gcttttccgc tgcataaccc tgcttcgggg tcattatagc 16200 

gattttttcg gtatatccat cctttttcgc acgatataca ggattttgcc aaagggttcg 16260 

tgtagacttt ccttggtgta tccaacggcg tcagccgggc aggataggtg aagtaggccc 16320 

acccgcgagc gggtgttcct tcttcactgt cccttattcg cacctggcgg tgctcaacgg 16380 

gaatcctgct ctgcgaggct ggccggctac cgccggcgta acagatgagg gcaagcggat 16440 

ggctgatgaa accaagccaa ccaggaaggg cagcccacct atcaaggtgt actgccttcc 16500 

agacgaacga agagcgattg aggaaaaggc ggcggcggcc ggcatgagcc tgtcggccta 16560 

cctgctggcc gtcggccagg gctacaaaat cacgggcgtc gtggactatg agcacgtccg 16620 

cgagctggcc cgcatcaatg gcgacctggg ccgcctgggc ggcctgctga aactctggct 16680 

caccgacgac ccgcgcacgg cgcggttcgg tgatgccacg atcctcgccc tgctggcgaa 16740 

gatcgaagag aagcaggacg agcttggcaa ggtcatgatg ggcgtggtcc gcccgagggc 16800 

agagccatga cttttttagc cgctaaaacg gccggggggt gcgcgtgatt gccaagcacg 16860 

tccccatgcg ctccatcaag aagagcgact tcgcggagct ggtgaagtac atcaccgacg 16920 

agcaaggcaa gaccgagcgc ctttgcgacg ctca 16954 



<210> 44 

<211> 16954 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



BASF AG 

BASF NAE 877/03 



203/365 



January 08, 2004 



<220> 

<221> misc_feature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 44 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 
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atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 
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tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 
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ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 
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gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattrt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 
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cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 684 0 
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cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 
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tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 
acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 
cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 
agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 
gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 
atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 
gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 
ctg.ctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 
cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 
tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 
aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 
cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 
tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 
tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 
ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 
aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 
cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 
ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 
gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 
tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 
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ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 94 80 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 
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canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt agagataaaa 10800 

taaaaagaga agaaaagaaa gtttgtacaa tttctttttg tttatataac atacacgcta 10860 

tgtcaacatt tagaataagg gggaaaaaat cttccatcat attcgaatgc acaagattat 10920 

ttctttgttc gctctttttg gtcgggtcat cgagatttag agtgtaatca aagatactgt 10980 

catctcgaga gcgttgcaca ggctgctgtt tgccaaattg gatgtttgcc gaattagtaa 11040 

aatacgcaag catttcttac ctttccgctc ccttttccta attctcccaa agactaaatg 11100 

aggaaagata aaggacaaag aaaatgtaaa gacaaagaaa ttgaaaacga tataaacttg 11160 

cagcacgtaa gaccaaagca aattggtaac tattcttgtg tacaaacatg tataaaaaaa 11220 

aacttttttt tgctcctgga ggacaaaatt tcaaactcct tgaagaagat tgcttgtata 11280 

tctatcatat gcatatatca tatcgatgga aaaagaaagt caggcatgta tttataaaaa 11340 

gaagaatgtg ccatgcttcc gaatttcttt tcactttctt ttccttatct attttaatct 11400 

catgttgtgg atttggaatg ccctgatcgt tttcgttacc gtgattggca tggaagtgat 11460 

tgctgcactg gcacacaaat acatcatgca cggctggggt tggggatggc atctttcaca 11520 

tcatgaaccg cgtaaaggtg cgtttgaagt taacgatctt tatgccgtgg tttttgctgc 11580 

attatcgatc ctgctgattt atctgggcag tacaggaatg tggccgctcc agtggattgg 11640 

cgcaggtatg acggcgtatg gattactcta ttttatggtg cacgacgggc tggtgcatca 11700 

acgttggcca ttccgctata ttccacgcaa gggctacctc aaacggttgt atatggcgca 11760 



BASF AG 21 3/365 January 08, 2004 
BASF NAE 877/03 

ccgtatgcat cacgccgtca ggggcaaaga aggttgtgtt tcttttggct tcctctatgc 11820 

gccgcccctg tcaaaacttc aggcgacgct ccgggaaaga catggcgcta gagcgggcgc 11880 

tgccagagat gcgcagggcg gggaggatga gcccgcatcc gggaagtaag ggcctgacca 11940 

gaggcggcca gcagcagcgt taatttttcg ggcgtggtcg ttgactgccg ctgatcccaa 12000 

agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc gctcacaatt 12060 

ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta atgagtgagc 12120 

taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa cctgtcgtgc 12180 

cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat tgggccaaag 12240 

acaaaagggc gacattcaac cgattgaggg agggaaggta aatattgacg gaaattattc 12300 

attaaaggtg aattatcacc gtcaccgact tgagccattt gggaattaga gccagcaaaa 12360 

tcaccagtag caccattacc attagcaagg ccggaaacgt caccaatgaa accatcgata 12420 

gcagcaccgt aatcagtagc gacagaatca agtttgcctt tagcgtcaga ctgtagcgcg 12480 

ttttcatcgg cattttcggt catagccccc ttattagcgt ttgccatctt ttcataatca 12540 

aaatcaccgg aaccagagcc accaccggaa ccgcctccct cagagccgcc accctcagaa 12600 

ccgccaccct cagagccacc accctcagag ccgccaccag aaccaccacc agagccgccg 12660 

ccagcattga caggaggccc gatctagtaa catagatgac accgcgcgcg ataatttatc 12720 

ctagtttgcg cgctatattt tgttttctat cgcgtattaa atgtataatt gcgggactct 12780 

aatcataaaa acccatctca taaataacgt catgcattac atgttaatta ttacatgctt 12840 

aacgtaattc aacagaaatt atatgataat catcgcaaga ccggcaacag gattcaatct 12900 

taagaaactt tattgccaaa tgtttgaacg atcggggatc atccgggtct gtggcgggaa 12960 

ctccacgaaa atatccgaac gcagcaagat atcgcggtgc atctcggtct tgcctgggca 13020 
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gtcgccgccg acgccgttga tgtggacgcc gggcccgatc atattgtcgc tcaggatcgt 13080 

ggcgttgtgc ttgtcggccg ttgctgtcgt aatgatatcg gcaccttcga ccgcctgttc 13140 

cgcagagatc ccgtgggcga agaactccag catgagatcc ccgcgctgga ggatcatcca 13200 

gccggcgtcc cggaaaacga ttccgaagcc caacctttca tagaaggcgg cggtggaatc 13260 

gaaatctcgt gatggcaggt tgggcgtcgc ttggtcggtc atttcgaacc ccagagtccc 13320 

gctcagaaga actcgtcaag aaggcgatag aaggcgatgc gctgcgaatc gggagcggcg 13380 

ataccgtaaa gcacgaggaa gcggtcagcc cattcgccgc caagctcttc agcaatatca 13440 

cgggtagcca acgctatgtc ctgatagcgg tccgccacac ccagccggcc acagtcgatg 13500 

aatccagaaa agcggccatt ttccaccatg atattcggca agcaggcatc gccatgggtc 13560 

acgacgagat catcgccgtc gggcatgcgc gccttgagcc tggcgaacag ttcggctggc 13620 

gcgagcccct gatgctcttc gtccagatca tcctgatcga caagaccggc ttccatccga 13680 

gtacgtgctc gctcgatgcg atgtttcgct tggtggtcga atgggcaggt agccggatca 13740 

agcgtatgca gccgccgcat tgcatcagcc atgatggata ctttctcggc aggagcaagg 13800 

tgagatgaca ggagatcctg ccccggcact tcgcccaata gcagccagtc ccttcccgct 13860 

tcagtgacaa cgtcgagcac agctgcgcaa ggaacgcccg tcgtggccag ccacgatagc 13920 

cgcgctgcct cgtcctgcag ttcattcagg gcaccggaca ggtcggtctt gacaaaaaga 13980 

accgggcgcc cctgcgctga cagccggaac acggcggcat cagagcagcc gattgtctgt 14040 

tgtgcccagt catagccgaa tagcctctcc acccaagcgg ccggagaacc tgcgtgcaat 14100 

ccatcttgtt caatcatgcg aaacgatcca gatccggtgc agattatttg gattgagagt 14160 

gaatatgaga ctctaattgg ataccgaggg gaatttatgg aacgtcagtg gagcattttt 14220 
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gacaagaaat atttgctagc tgatagtgac cttaggcgac ttttgaacgc gcaataatgg 
tttctgacgt atgtgcttag ctcattaaac tccagaaacc cgcggctgag tggctccttc 
aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 
tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 
ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 
attagaataa tcggatattt aaaagggcgt gaaaaggttt atccgttcgt ccatttgtat 
gtgcatgcca accacagggt tccccagatc tggcgccggc cagcgagacg agcaagattg 
gccgccgccc gaaacgatcc gacagcgcgc ccagcacagg tgcgcaggca aattgcacca 
acgcatacag cgccagcaga atgccatagt gggcggtgac gtcgttcgag tgaaccagat 
cgcgcaggag gcccggcagc accggcataa tcaggccgat gccgacagcg tcgagcgcga 
cagtgctcag aattacgatc aggggtatgt tgggtttcac gtctggcctc cggaccagcc 
tccgctggtc cgattgaacg cgcggattct ttatcactga taagttggtg gacatattat 
gtttatcagt gataaagtgt caagcatgac aaagttgcag ccgaatacag tgatccgtgc 
cgccctggac ctgttgaacg aggtcggcgt agacggtctg acgacacgca aactggcgga 
acggttgggg gttcagcagc cggcgcttta ctggcacttc aggaacaagc gggcgctgct 
cgacgcactg gccgaagcca tgctggcgga gaatcatacg cattcggtgc cgagagccga 
cgacgactgg cgctcatttc tgatcgggaa tgcccgcagc ttcaggcagg cgctgctcgc 
ctaccgcgat ggcgcgcgca tccatgccgg cacgcgaccg ggcgcaccgc agatggaaac 
ggccgacgcg cagcttcgct tcctctgcga ggcgggtttt tcggccgggg acgccgtcaa 
tgcgctgatg acaatcagct acttcactgt tggggccgtg cttgaggagc aggccggcga 
cagcgatgcc ggcgagcgcg gcggcaccgt tgaacaggct ccgctctcgc cgctgttgcg 
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ggccgcgata gacgccttcg acgaagccgg tccggacgca gcgttcgagc agggactcgc 15540 

ggtgattgtc gatggattgg cgaaaaggag gctcgttgtc aggaacgttg aaggaccgag 15600 

aaagggtgac gattgatcag gaccgctgcc ggagcgcaac ccactcacta cagcagagcc 15660 

atgtagacaa catcccctcc ccctttccac cgcgtcagac gcccgtagca gcccgctacg 15720 

ggctttttca tgccctgccc tagcgtccaa gcctcacggc cgcgctcggc ctctctggcg 15780 

gccttctggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg 15840 

cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat 15900 

aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc 15960 

gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc 16020 

tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga 16080 

agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt 16140 

ctcccttcgg gaagcgtggc gcttttccgc tgcataaccc tgcttcgggg tcattatagc 16200 

gattttttcg gtatatccat cctttttcgc acgatataca ggattttgcc aaagggttcg 16260 

tgtagacttt ccttggtgta tccaacggcg tcagccgggc aggataggtg aagtaggccc 16320 

acccgcgagc gggtgttcct tcttcactgt cccttattcg cacctggcgg tgctcaacgg 16380 

gaatcctgct ctgcgaggct ggccggctac cgccggcgta acagatgagg gcaagcggat 16440 

ggctgatgaa accaagccaa ccaggaaggg cagcccacct atcaaggtgt actgccttcc 16500 

agacgaacga agagcgattg aggaaaaggc ggcggcggcc ggcatgagcc tgtcggccta 16560 

cctgctggcc gtcggccagg gctacaaaat cacgggcgtc gtggactatg agcacgtccg 16620 

cgagctggcc cgcatcaatg gcgacctggg ccgcctgggc ggcctgctga aactctggct 16680 
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caccgacgac ccgcgcacgg cgcggttcgg tgatgccacg atcctcgccc tgctggcgaa 16740 

gatcgaagag aagcaggacg agcttggcaa ggtcatgatg ggcgtggtcc gcccgagggc 16800 

agagccatga cttttttagc cgctaaaacg gccggggggt gcgcgtgatt gccaagcacg 16860 

tccccatgcg ctccatcaag aagagcgact tcgcggagct ggtgaagtac atcaccgacg 16920 

agcaaggcaa gaccgagcgc ctttgcgacg ctca 16954 
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agcttggtac cgagctcgga tccactagta acggccgcca gtgtgctgga attcgccctt 60 
gacggccagt gaattcgagc tcggtacccg gggatctttc gacactgaaa tacgtcgagc 120 
ctgctccgct tggaagcggc gaggagcctc gtcctgtcac aactaccaac atggagtacg 180 
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ataagggcca gttccgccag ctcattaaga gccagttcat gggcgttggc atgatggccg 240 

tcatgcatct gtacttcaag tacaccaacg ctcttctgat ccagtcgatc . atccgctgaa 300 

ggcgctttcg aatctggtta agatccacgt cttcgggaag ccagcgactg gtgacctcca 360 

gcgtcccttt aaggctgcca acagctttct cagccagggc cagcccaaga ccgacaaggc 420 

ctccctccag aacgccgaga agaactggag gggtggtgtc aaggaggagt aagctcctta 480 

ttgaagtcgg aggacggagc ggtgtcaaga ggatattctt cgactctgta ttatagataa 540 

gatgatgagg aattggaggt agcatagctt catttggatt tgctttccag gctgagactc 600 

tagcttggag catagagggt cctttggctt tcaatattct caagtatctc gagtttgaac 660 

ttattccctg tgaacctttt attcaccaat gagcattgga atgaacatga atctgaggac 720 

tgcaatcgcc atgaggtttt cgaaatacat ccggatgtcg aaggcttggg gcacctgcgt 780 

tggttgaatt tagaacgtgg cactattgat catccgatag ctctgcaaag ggcgttgcac 840 

aatgcaagtc aaacgttgct agcagttcca ggtggaatgt tatgatgagc attgtattaa 900 

atcaggagat atagcatgat ctctagttag ctcaccacaa aagtcagacg gcgtaaccaa 960 

aagtcacaca acacaagctg taaggatttc ggcacggcta cggaagacgg agaagccacc 1020 

ttcagtggac tcgagtacca tttaattcta tttgtgtttg atcgagacct aatacagccc 1080 

ctacaacgac catcaaagtc gtatagctac cagtgaggaa gtggactcaa atcgacttca 1140 

gcaacatctc ctggataaac tttaagccta aactatacag aataagatag gtggagagct 1200 

tataccgagc tcccaaatct gtccagatca tggttgaccg gtgcctggat cttcctatag 1260 

aatcatcctt attcgttgac ctagctgatt ctggagtgac ccagagggtc atgacttgag 1320 

cctaaaatcc gccgcctcca ccatttgtag aaaaatgtga cgaactcgtg agctctgtac 1380 
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agtgaccggt gactctttct ggcatgcgga gagacggacg gacgcagaga gaagggctga 1440 

gtaataagcc actggccaga cagctctggc ggctctgagg tgcagtggat gattattaat 1500 

ccgggaccgg ccgcccctcc gccccgaagt ggaaaggctg gtgtgcccct cgttgaccaa 1560 

gaatctattg catcatcgga gaatatggag cttcatcgaa tcaccggcag taagcgaagg 1620 

agaatgtgaa gccaggggtg tatagccgtc ggcgaaatag catgccatta acctaggtac 1680 

agaagtccaa ttgcttccga tctggtaaaa gattcacgag atagtacctt ctccgaagta 1740 

ggtagagcga gtacccggcg cgtaagctcc ctaattggcc catccggcat ctgtagggcg 1800 

tccaaatatc gtgcctctcc tgctttgccc ggtgtatgaa accggaaagg ccgctcagga 1860 

gctggccagc ggcgcagacc gggaacacaa gctggcagtc gacccatccg gtgctctgca 1920 

ctcgacctgc tgaggtccct cagtccctgg taggcagctt tgccccgtct gtccgcccgg 1980 

tgtgtcggcg gggttgacaa ggtcgttgcg tcagtccaac atttgttgcc atattttcct 2040 

gctctcccca ccagctgctc ttttcttttc tctttctttt cccatcttca gtatattcat 2100 

cttcccatcc aagaaccttt atttccccta agtaagtact ttgctacatc catactccat 2160 

ccttcccatc ccttattcct ttgaaccttt cagttcgagc tttcccactt catcgcagct 2220 

tgactaacag ctaccccgct tgagcagaca tcaccatgct gtcgaagctg cagtcaatca 2280 

gcgtcaaggc ccgccgcgtt gaactagccc gcgacatcac gcggcccaaa gtctgcctgc 2340 

atgctcagcg gtgctcgtta gttcggctgc gagtggcagc accacagaca gaggaggcgc 2400 

tgggaaccgt gcaggctgcc ggcgcgggcg atgagcacag cgccgatgta gcactccagc 24 60 

agcttgaccg ggctatcgca gagcgtcgtg cccggcgcaa acgggagcag ctgtcatacc 2520 

aggctgccgc cattgcagca tcaattggcg tgtcaggcat tgccatcttc gccacctacc 2580 

tgagatttgc catgcacatg accgtgggcg gcgcagtgcc atggggtgaa gtggctggca 2640 
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ctctcctctt ggtggttggt ggcgcgctcg gcatggagat gtatgcccgc tatgcacaca 2700 

aagccatctg gcatgagtcg cctctgggct ggctgctgca caagagccac cacacacctc 2760 

gcactggacc ctttgaagcc aacgacttgt ttgcaatcat caatggactg cccgccatgc 2820 

tcctgtgtac ctttggcttc tggctgccca acgtcctggg ggcggcctgc tttggagcgg 2880 

ggctgggcat cacgctatac ggcatggcat atatgtttgt acacgatggc ctggtgcaca 2940 

ggcgctttcc caccgggccc atcgctggcc tgccctacat gaagcgcctg acagtggccc 3000 

accagctaca ccacagcggc aagtacggtg gcgcgccctg gggtatgttc ttgggtccac 3060 

aggagctgca gcacattcca ggtgcggcgg aggaggtgga gcgactggtc ctggaactgg 3120 

actggtccaa gcggtagggt gcggaaccag gcacgctggt ttcacacctc atgcctgtga 3180 

taaggtgtgg ctagagcgat gcgtgtgaga cgggtatgtc acggtcgact ggtctgatgg 3240 

ccaatggcat cggccatgtc tggtcatcac gggctggttg cctgggtgaa ggtgatgcac 3300 

atcatcatgt gcggttggag gggctggcac agtgtgggct gaactggagc agttgtccag 3360 

gctggcgttg aatcagtgag ggtttgtgat tggcggttgt gaagcaatga ctccgcccat 3420 

attctatttg tgggagctga gatgatggca tgcttgggat gtgcatggat catggtagtg 3480 

cagcaaacta tattcaccta gggctgttgg taggatcagg tgaggccttg cacattgcat 3540 

gatgtactcg tcatggtgtg ttggtgagag gatggatgtg gatggatgtg tattctcaga 3600 

cgtagacctt gactggaggc ttgatcgaga gagtgggccg tattctttga gaggggaggc 3660 

tcgtgccaga aatggtgagt ggatgactgt gacgctgtac attgcaggca ggtgagatgc 3720 

actgtctcga ttgtaaaata cattcagatg caagcttggc gtaatcatgg tcatagctgt 3780 

ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa 3840 
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agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac 3900 

tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg 3960 

cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca accgattgag 4020 

ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca ccgtcaccga 4080 

cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta ccattagcaa 4140 

ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta gcgacagaat 4200 

caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg gtcatagccc 4260 

ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag ccaccaccgg 4320 

aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca ccaccctcag 4380 

agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc ccgatctagt 4440 

aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct 4500 

atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac 4560 

gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata 4620 

atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa 4 680 

cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag 4740 

atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt gatgtggacg 4800 

ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc cgttgctgtc 4860 

gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc gaagaactcc 4 920 

agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac gattccgaag 4 980 

cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc 5040 

gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca agaaggcgat 5100 
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agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg aagcggtcag 5160 

cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg tcctgatagc 5220 

ggtccgccac acccagccgg ccacagtcga tgaatccaga aaagcggcca ttttccacca 5280 

tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg tcgggcatgc 5340 

gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct tcgtccagat 54 00 

catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg 54 60 

cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc attgcatcag 5520 

ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc tgccccggca 5580 

cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc acagctgcgc 5640 

aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc agttcattca 5700 

gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga 57 60 

acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg aatagcctct 5820 

ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc 5880 

cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt ggataccgag 5940 

gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta gctgatagtg 6000 

accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt agctcattaa 6060 

actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca gttccaaacg 6120 

taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt aattctccgc 6180 

tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg acaggatata 6240 

ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat ttaaaagggc 6300 
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gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg gttccccaga 6360 

tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat ccgacagcgc 64 20 

gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca gaatgccata 6480 

gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca gcaccggcat 6540 

aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga tcaggggtat 6600 

gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa cgcgcggatt 6660 

ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt gtcaagcatg 6720 

acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa cgaggtcggc 6780 

gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca gccggcgctt 6840 

tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc catgctggcg 6900 

gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt tctgatcggg 6960 

aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg catccatgcc 7020 

ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg cttcctctgc 7080 

gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag ctacttcact 7140 

gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg cggcggcacc 7200 

gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt cgacgaagcc 7260 

ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt ggcgaaaagg 7320 

aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc aggaccgctg 7380 

ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct ccccctttcc 7440 

accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc cctagcgtcc 7500 

aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc gcttcctcgc 7560 
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tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg 7 620 

cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag 7680 

gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc 7740 

gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag 7800 

gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga 7860 

ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcttttcc 7920 

gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc at cctt'tttc 7980 

gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg tatccaacgg 8040 

cgtcagccgg gcaggatagg' tgaagtaggc ccacccgcga gcgggtgttc cttcttcact 8100 

gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg ctggccggct 8160 

accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc aaccaggaag 8220 

ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat tgaggaaaag 8280 

gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca gggctacaaa 8340 

atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa tggcgacctg 84 00 

ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac ggcgcggttc 8460 

ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga cgagcttggc 8520 

aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta gccgctaaaa 8580 

cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca agaagagcga 8640 

cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc gcctttgcga 8700 

cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc cctgcaaacg 8760 
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cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt tgtggatacc 8820 

tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact tgaggggccg 8880 

actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg gcgacgtgga 8940 

gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc ccacagatga 9000 

tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc gcgactactg 9060 

acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga tgaggggcgc 9120 

acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc aagggtttcc 9180 

gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca atatttataa 9240 

accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg aaggggggtg 9300 

cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc ccaggggctg 9360 

cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt ccttgccatt 9420 

gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc cggaagcatt 9480 

gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag tgagggcggc 9540 

ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga cttcatggcg 9600 

gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc cgtgctcgtg 9660 

ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt ataccgaggt 9720 

atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat ttaaaaagct 9780 

accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat attgacaata 9840 

ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga tttcaggggg 9900 

caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca taaaaacttg 9960 

catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt ctatcataat 10020 
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tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc gatgactttg 10080 

tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg tgccaggtgc 10140 

tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct gattacgtgc 10200 

agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca tatcaccacg 10260 

tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg ttcaccgaat 10320 

acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca gcgctggcgc 10380 

gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat gacgtcactg 10440 

cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga cgtaaaatcg 10500 

tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca ttcatggcca 10560 

tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac tgcagttgcc 10620 

atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt ttgccgttac 10680 

gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa gccactggag 10740 

cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc cataattgtg 10800 

gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac aactttgaaa 10860 

aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg gagttcgtct 10920 

tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa ggaaataata 10980 

aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat accgctgcgt 11040 

aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag aaaatgaaaa 11100 

cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg tggaacggga 11160 

aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc tgcactttga 11220 
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acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc tttgctcgga 11280 

agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg agtgcatcag 11340 

gctctttcac tccatcgaca tatcggattg tccctatacg aatagcttag acagccgctt 11400 

agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg aaaactggga 11460 

agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga cggaaaagcc 11520 

cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct ttgtgaaaga 11580 

tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca agtggtatga 11640 

cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt atgtcgagct 11700 

attttttgac ttactgggga tcaagcctga ttgggagaaa ataaaatatt atattttact 11760 

ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag caggagcgca 11820 

ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc aagtatttgg 11880 

gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac gagaaggacg 11940 

gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg gacaccaagg 12000 

caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc ggggcaatcc 12060 

cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa gaactgatcg 12120 

acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc atgcgtgcgc 12180 

cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc aagatcgagc 12240 

gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc gtggagcgtt 12300 

cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc gacacgcgag 12360 

gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa caggtcagcg 12420 

aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa atgcagcttt 12480 
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ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac gacacggccc 12540 

gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg caaaacaagg 12600 

tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag ctgcgggccg 12660 

acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc cctatcggcg 12720 

agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg atcaatggcc 12780 

ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg atgggcttca 12840 

cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc cgcgtcctgg 12900 

accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc gtcgtgctgt 12960 

ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg tcgccgacgg 13020 

cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc aagctggaaa 13080 

ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc gagcaggtcg 1314 0 

gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg gtcaatgatg 13200 

acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg ggttcagcag 13260 

ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact tgcttcgctc 13320 

agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag gattaaaatt 13380 

gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc aggatttccg 13440 

cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg tttacgagca 13500 

cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg tggcattcgg 13560 

cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg acggccccaa 13620 

ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc gaggccgagg 13680 
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ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga tgatcgtccg 13740 

acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac ttaatatttc 13800 

gctattctgg agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg tcgcggcgac 13860 

ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc taggtagccc 13920 

gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg cgctgttggt 13980 

gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg cgggggcggt 14040 

ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc ctctgctcac 14100 

ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag ctttagtgtt 14160 

tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt ggctcggcct 14220 

gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac tcgaacctac 14280 

agttgtttcc ttactgggct ttctcagccc cagatctggg gtcgatcagc cggggatgca 14340 

tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag caatggatag 14400 

gggagttgat atcgtcaaeg ttcacttcta aagaaatagc gccactcagc ttcctcagcg 14460 

gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca gcctgtcacg 14520 

gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg agatgatatt 14580 

tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct ccgcgagatc 14 640 

atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc ggtaacatga 14700 

gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact gatgggctgc 14760 

ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg ctggctggtg 14820 

gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac acattgcgga 14880 

cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa cagctgattg 14940 
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cccttcaccg cctggccctg agagagttgc agcaagcggt ccacgctggt ttgccccagc 15000 

aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat. aaatcaaaag 15060 

aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca ctattaaaga 15120 

acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc ccactacgtg 15180 

aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta aatcggaacc 15240 

ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg gcgagaaagg 15300 

aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg ggaagggcga 15360 

tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 15420 

ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 15480 

ttcgagctcg gtacccgggg atctttcgac actgaaatac gtcgagcctg ctccgcttgg 15540 

aagcggcgag gagcctcgtc ctgtcacaac taccaacatg gagtacgata agggccagtt 15600 

ccgccagctc attaagagcc agttcatggg cgttggcatg atggccgtca tgcatctgta 15660 

cttcaagtac accaacgctc ttctgatcca gtcgatcatc cgctgaaggc gctttcg.aat 15720 

ctggttaaga tccacgtctt cgggaagcca gcgactggtg acctccagcg tccctttaag 15780 

gctgccaaca gctttctcag ccagggccag cccaagaccg acaaggcctc cctccagaac 15840 

gccgagaaga actggagggg tggtgtcaag gaggagtaag ctccttattg aagtcggagg 15900 

acggagcggt gtcaagagga tattcttcga ctctgtatta tagataagat gatgaggaat 15960 

tggaggtagc atagcttcat ttggatttgc tttccaggct gagactctag cttggagcat 16020 

agagggtcct ttggctttca atattctcaa gtatctcgag tttgaactta ttccctgtga 16080 

accttttatt caccaatgag cattggaatg aacatgaatc tgaggactgc aatcgccatg 16140 
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aggttttcga aatacatccg gatgtcgaag gcttggggca cctgcgttgg ttgaatttag 16200 

aacgtggcac tattgatcat ccgatagctc tgcaaagggc gttgcacaat gcaagtcaaa 16260 

cgttgctagc agttccaggt ggaatgttat gatgagcatt gtattaaatc aggagatata 16320 

gcatgatctc tagttagctc accacaaaag tcagacggcg taaccaaaag tcacacaaca 16380 

caagctgtaa ggatttcggc acggctacgg aagacggaga agccaccttc agtggactcg 16440 

agtaccattt aattctattt gtgtttgatc gagacctaat acagccccta caacgaccat 16500 

caaagtcgta tagctaccag tgaggaagtg gactcaaatc gacttcagca acatctcctg 16560 

gataaacttt aagcctaaac tatacagaat aagataggtg gagagcttat accgagctcc 16620 

caaatctgtc cagatcatgg ttgaccggtg cctggatctt cctatagaat catccttatt 16680 

cgttgaccta gctgattctg gagtgaccca gagggtcatg acttgagcct aaaatccgcc 16740 

gcctccacca tttgtagaaa aatgtgacga actcgtgagc tctgtacagt gaccggtgac 16800 

tctttctggc atgcggagag acggacggac gcagagagaa gggctgagta ataagccact 16860 

ggccagacag ctctggcggc tctgaggtgc agtggatgat tattaatccg ggaccggccg 16920 

cccctccgcc ccgaagtgga aaggctggtg tgcccctcgt tgaccaagaa tctattgcat 16980 

catcggagaa tatggagctt catcgaatca ccggcagtaa gcgaaggaga atgtgaagcc 17040 

aggggtgtat agccgtcggc gaaatagcat gccattaacc taggtacaga agtccaattg 17100 

cttccgatct ggtaaaagat tcacgagata gtaccttctc cgaagtaggt agagcgagta 17160 

cccggcgcgt aagctcccta attggcccat ccggcatctg tagggcgtcc aaatatcgtg 17220 

cctctcctgc tttgcccggt gtatgaaacc ggaaaggccg ctcaggagct ggccagcggc 17280 

gcagaccggg aacacaagct ggcagtcgac ccatccggtg ctctgcactc gacctgctga 17340 

ggtccctcag tccctggtag gcagctttgc cccgtctgtc cgcccggtgt gtcggcgggg 17400 
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ttgacaaggt cgttgcgtca gtccaacatt tgttgccata ttttcctgct ctccccacca 17460 

gctgctcttt tcttttctct ttcttttccc atcttcagta tattcatctt cccatccaag 17520 

aacctttatt tcccctaagt aagtactttg ctacatccat actccatcct tcccatccct 17580 

tattcctttg aacctttcag ttcgagcttt cccacttcat cgcagcttga ctaacagcta 17640 

ccccgcttga gcagacatca ccatgcctga actcaccgcg acgtctgtcg agaagtttct 17700 

gatcgaaaag ttcgacagcg tctccgacct gatgcagctc tcggagggcg aagaatctcg 17760 

tgctttcagc ttcgatgtag gagggcgtgg atatgtcctg cgggtaaata gctgcgccga 17820 

tggtttctac aaagatcgtt atgtttatcg gcactttgca tcggccgcgc tcccgattcc 17880 

ggaagtgctt gacattgggg aattcagcga gagcctgacc tattgcatct cccgccgtgc 17940 

acagggtgtc acgttgcaag acctgcctga aaccgaactg cccgctgttc tgcagccggt 18000 

cgcggaggcc atggatgcga tcgctgcggc cgatcttagc cagacgagcg ggttcggccc 18060 

attcggaccg caaggaatcg gtcaatacac tacatggcgt gatttcatat gcgcgattgc 18120 

tgatccccat gtgtatcact ggcaaactgt gatggacgac accgtcagtg cgtccgtcgc 18180 

gcaggctctc gatgagctga tgctttgggc cgaggactgc cccgaagtcc ggcacctcgt 18240 

gcacgcggat ttcggctcca acaatgtcct gacggacaat ggccgcataa cagcggtcat 18300 

tgactggagc gaggcgatgt tcggggattc ccaatacgag gtcgccaaca tcttcttctg 18360 

gaggccgtgg ttggcttgta tggagcagca gacgcgctac ttcgagcgga ggcatccgga 18420 

gcttgcagga tcgccgcggc tccgggcgta tatgctccgc attggtcttg accaactcta 18480 

tcagagcttg gttgacggca atttcgatga tgcagcttgg gcgcagggtc gatgcgacgc 18540 

aatcgtccga tccggagccg ggactgtcgg gcgtacacaa atcgcccgca gaagcgcggc 18600 
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cgtctggacc gatggctgtg tagaagtact cgccgatagt ggaaaccgac gccccagcac 18660 

tcgtccgagg gcaaaggaat agagtagatg ccgaccgcgg gatcgatcca cttaacgtta 18720 

ctgaaatcat caaacagctt gacgaatctg gatataagat cgttggtgtc gatgtcagct 18780 

ccggagttga gacaaatggt gttcaggatc tcgataagat acgttcattt gtccaagcag 18840 

caaagagtgc cttctagtga tttaatagct ccatgtcaac aagaataaaa cgcgttttcg 18900 

ggtttacctc ttccagatac agctcatctg caatgcatta atgcattgac tgcaacctag 18960 

taacgccttn caggctccgg cgaagagaag aatagcttag cagagctatt ttcattttcg 19020 

ggagacgaga tcaagcagat caacggtcgt caagagacct acgagactga ggaatccgct 19080 

cttggctcca cgcgactata tatttgtctc taattgtact ttgacatgct cctcttcttt 19140 

actctgatag cttgactatg aaaattccgt caccagcncc tgggttcgca aagataattg 19200 

catgtttctt ccttgaactc tcaagcctac aggacacaca ttcatcgtag gtataaacct 19260 

cgaaatcant tcctactaag atggtataca atagtaacca tgcatggttg cctagtgaat 19320 

gctccgtaac acccaatacg ccggccgaaa cttttttaca actctcctat gagtcgttta 19380 

cccagaatgc acaggtacac ttgtttagag gtaatccttc tttctagcta gaagtcctcg 19440 

tgtactgtgt aagcgcccac tccacatctc cactcgacct gcaggcatgc a 19491 



<210> 46 

<211> 21300 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 
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<221> misc_f eature 
<222> (3471) . . (3471) 
<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3770) . . (3770) 

<223> n is a, c, g, or t 

<400> 46 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 
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tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 
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taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 24 60 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 
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tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caagcttgaa ttcgagctcg gtacccgggg 4020 

atctttcgac actgaaatac gtcgagcctg ctccgcttgg aagcggcgag gagcctcgtc 4080 

ctgtcacaac taccaacatg gagtacgata agggccagtt ccgccagctc attaagagcc 4140 

agttcatggg cgttggcatg atggccgtca tgcatctgta cttcaagtac accaacgctc 4200 

ttctgatcca gtcgatcatc cgctgaaggc gctttcgaat ctggttaaga tccacgtctt 4260 

cgggaagcca gcgactggtg acctccagcg tccctttaag gctgccaaca gctttctcag 4320 

ccagggccag cccaagaccg acaaggcctc cctccagaac gccgagaaga actggagggg 4380 

tggtgtcaag gaggagtaag ctccttattg aagtcggagg acggagcggt gtcaagagga 4440 

tattcttcga ctctgtatta tagataagat gatgaggaat tggaggtagc atagcttcat 4500 
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ttggatttgc tttccaggct gagactctag cttggagcat agagggtcct ttggctttca 4560 

atattctcaa gtatctcgag tttgaactta ttccctgtga accttttatt caccaatgag 4620 

cattggaatg aacatgaatc tgaggactgc aatcgccatg aggttttcga aatacatccg 4680 

gatgtcgaag gcttggggca cctgcgttgg ttgaatttag aacgtggcac tattgatcat 4740 

ccgatagctc tgcaaagggc gttgcacaat gcaagtcaaa cgttgctagc agttccaggt 4800 

ggaatgttat gatgagcatt gtattaaatc aggagatata gcatgatctc tagttagctc 4860 

accacaaaag tcagacggcg taaccaaaag tcacacaaca caagctgtaa ggatttcggc 4920 

acggctacgg aagacggaga agccaccttc agtggactcg agtaccattt aattctattt 4980 

gtgtttgatc gagacctaat acagccccta caacgaccat caaagtcgta tagctaccag 5040 

tgaggaagtg gactcaaatc gacttcagca acatctcctg gataaacttt aagcctaaac 5100 

tatacagaat aagataggtg gagagcttat accgagctcc caaatctgtc cagatcatgg 5160 

ttgaccggtg cctggatctt cctatagaat catccttatt cgttgaccta gctgattctg 5220 

gagtgaccca gagggtcatg acttgagcct aaaatccgcc gcctccacca tttgtagaaa 5280 

aatgtgacga actcgtgagc tctgtacagt gaccggtgac tctttctggc atgcggagag 5340 

acggacggac gcagagagaa gggctgagta ataagccact ggccagacag ctctggcggc 5400 

tctgaggtgc agtggatgat tattaatccg ggaccggccg cccctccgcc ccgaagtgga 54 60 

aaggctggtg tgcccctcgt tgaccaagaa tctattgcat catcggagaa tatggagctt 5520 

catcgaatca ccggcagtaa gcgaaggaga atgtgaagcc aggggtgtat agccgtcggc 5580 

gaaatagcat gccattaacc taggtacaga agtccaattg cttccgatct ggtaaaagat 5640 

tcacgagata gtaccttctc cgaagtaggt agagcgagta cccggcgcgt aagctcccta 5700 
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attggcccat ccggcatctg tagggcgtcc aaatatcgtg cctctcctgc tttgcccggt 5760 

gtatgaaacc ggaaaggccg ctcaggagct ggccagcggc gcagaccggg aacacaagct 5820 

ggcagtcgac ccatccggtg ctctgcactc gacctgctga ggtccctcag tccctggtag 5880 

gcagctttgc cccgtctgtc cgcccggtgt gtcggcgggg ttgacaaggt cgttgcgtca 594 0 

gtccaacatt tgttgccata ttttcctgct ctccccacca gctgctcttt tcttttctct 6000 

ttcttttccc atcttcagta tattcatctt cccatccaag aacctttatt tcccctaagt 6060 

aagtactttg ctacatccat actccatcct tcccatccct tattcctttg aacctttcag 6120 

ttcgagcttt cccacttcat cgcagcttga ctaacagcta ccccgcttga gcagacatca 6180 

ccatgtcaat actcacttat ctggaatttc atctctacta tacactacct gtccttgcgg 6240 

cattgtgttg gctgctaaag ccgtttcact cacagcaaga caatctcaag tataaatttt 6300 

taatgttgat ggccgcctct accgcatcga tttgggacaa ttatatcgtt tatcatcgcg 6360 

cttggtggta ctgtcctact tgtgttgtgg ctgtcattgg ctatgtacct ctagaagaat 6420 

acatgttctt tatcatcatg actttaatga ctgtcgcgtt ctcaaacttt gttatgcgtt 6480 

ggcacttgca tactttcttt attagaccca acacttcttg gaagcaaaca ctattagtac 6540 

gccttgtgcc tgtttcagct ttattggcaa tcacttatca tgcttggcac ttgacactgc 6600 

caaataaacc ttcattttat ggttcatgca tcctttggta tgcttgtcct gtgttggcta 6660 

ttctttggct gggtgctggc gaatatatct tgcgtcgacc tgtggctgtc cttttgtcta 6720 

ttgttatccc tagtgtatac ctatgttggg ctgatatcgt cgctattagt gctggcacat 6780 

ggcatatttc tcttagaaca agcactggca aaatggtagt acccgattta cctgtagaag 6840 

aatgcctgtt ttttactttg atcaacacag tcttggtttt tgctacctgt gctatagacc 6900 

gcgctcaggc catcctccat gtgagcgcgc gtaatacgac tcactatagg gcgaattgga 6960 
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gctccaccgc ggtggcggcc gctctagaac tagtggatcc cccgggctgc aggaattcgg 7020 

cacgagctac atttcacaag cccgtgagcg gtgcaagcgc tctgccccac atcggcccac 7080 

ctcctcatct ccatcggtca tttgctgcta ccacgatgct gtcgaagctg cagtcaatca 7140 

gcgtcaaggc ccgccgcgtt gaactagccc gcgacatcac gcggcccaaa gtctgcctgc 7200 

atgctcagcg gtgctcgtta gttcggctgc gagtggcagc accacagaca gaggaggcgc 7260 

tgggaaccgt gcaggctgcc ggcgcgggcg atgagcacag cgccgatgta gcactccagc 7320 

agcttgaccg ggctatcgca gagcgtcgtg cccggcgcaa acgggagcag ctgtcatacc 7380 

aggctgccgc cattgcagca tcaattggcg tgtcaggcat tgccatcttc gccacctacc 7440 

tgagatttgc catgcacatg accgtgggcg gcgcagtgcc atggggtgaa gtggctggca 7500 

ctctcctctt ggtggttggt ggcgcgctcg gcatggagat gtatgcccgc tatgcacaca 7560 

aagccatctg gcatgagtcg cctctgggct ggctgctgca caagagccac cacacacctc 7620 

gcactggacc ctttgaagcc aacgacttgt ttgcaatcat caatggactg cccgccatgc 7680 

tcctgtgtac ctttggcttc tggctgccca acgtcctggg ggcggcctgc tttggagcgg 7740 

ggctgggcat cacgctatac ggcatggcat atatgtttgt acacgatggc ctggtgcaca 7800 

ggcgctttcc caccgggccc atcgctggcc tgccctacat gaagcgcctg acagtggccc 7860 

accagctaca ccacagcggc aagtacggtg gcgcgccctg gggtatgttc ttgggtccac 7920 

aggagctgca gcacattcca ggtgcggcgg aggaggtgga gcgactggtc ctggaactgg 7980 

actggtccaa gcgggctcag gccatcctcc atctgtacaa atcatctgtt caaaatcaaa 8040 

accctaaaca agccatttcc cttttccagc atgtcaaaga gctagcatgg gccttctgtc 8100 

ttcctgacca aatgctcaac aatgaattgt ttgatgatct tactatcagc tgggatattt 8160 
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tacgtaaagc ctcaaagtca ttctatactg catctgccgt ttttccaagt tatgtacgtc 8220 

aagacttggg tgttctctat gctttctgca gagctaccga tgacctgtgc gatgatgaat 8280 

ccaaatctgt tcaagaaaga agagaccaat tagatcttac tcgacaattt gttcgtgatc 8340 

tctttagcca aaagaccagt gcgcctattg tgattgattg ggaattgtat caaaaccaac 8400 

ttcctgcttc ttgtatatca gcctttagag cctttactcg ccttcgccat gtccttgaag 84 60 

tagaccctgt agaagaacta ttagatggtt acaaatggga tcttgagcgt cgtcctatcc 8520 

ttgatgaaca agacttggag gcatactctg cttgtgtggc cagtagtgtg ggtgaaatgt 8580 

gcacacgtgt gattcttgct caagaccaaa aggaaaatga tgcttggata attgaccgtg 8640 

cacgtgagat ggggctggtg ctacaatacg ttaacattgc tcgagacatt gtgactgata 8700 

gcgagactct gggtcgatgt tatctgcctc aacaatggct tagaaaagaa gaaacagaac 8760 

aaatacagca aggcaacgcc cgtagcctag gtgatcaaag actgttgggc ttgtctctga 8820 

agcttgtagg aaaggcagac gctatcatgg tgagagctaa gaagggcatt gacaagttgc 8880 

cggcaaactg tcaaggcggt: gtacgagctg cttgccaagt atatgctgca attggatctg 8940 

tactcaagca gcagaagaca acatatccta caagagctca tctaaaagga agcgaacgtg 9000 

ccaagattgc tctgttgagt gtatacaacc tctatcaatc tgaagacaag cctgtggctc 9060 

tccgtcaagc tagaaagatt aagagttttt ttgttgatta gtgaattttt gttttattta 9120 

tgtctgatag ttcaataaag agacaacaca tacaatataa aatcattgtc tttaaatgtt 9180 

aatttagtag agtgtaaagc ctgcattttt tttgtacgca taaacaatga gttcaccccg 9240 

cttctggttt ttaaataatt atgtcaaact agggaaaatt cttttttttc tcttcgttct 9300 

ttttttggct tgttgtggag tcacaggctt gtcttcagat tgatagaggt tgtatacact 9360 

caacagagca atcttggcac gttcgcttcc ttttagatga gctcttgtag gatatgttgt 9420 
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cttctgctgc ttgagtacag atccaattgc agcatatact tggcaagcag ctcgtacacc 9480 

gccttgacag tttgccggca acttgtcaat gcccttctta gctctcacca tgatagcgtc 9540 

tgcctttcct acaagcttgg cgtaatcatg gtcatagctg tttcctgtgt gaaattgtta 9600 

tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag cctggggtgc 9660 

ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt tccagtcggg 9720 

aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag gcggtttgcg 9780 

tattgggcca aagacaaaag ggcgacattc aaccgattga gggagggaag gtaaatattg 9840 

acggaaatta ttcattaaag gtgaattatc accgtcaccg acttgagcca tttgggaatt 9900 

agagccagca aaatcaccag tagcaccatt accattagca aggccggaaa cgtcaccaat 9960 

gaaaccatcg atagcagcac cgtaatcagt agcgacagaa tcaagtttgc ctttagcgtc 10020 

agactgtagc gcgttttcat cggcattttc ggtcatagcc cccttattag cgtttgccat 10080 

cttttcataa tcaaaatcac cggaaccaga gccaccaccg gaaccgcctc cctcagagcc 10140 

gccaccctca gaaccgccac cctcagagcc accaccctca gagccgccac cagaaccacc 10200 

accagagccg ccgccagcat tgacaggagg cccgatctag taacatagat gacaccgcgc 10260 

gcgataattt atcctagttt gcgcgctata ttttgttttc tatcgcgtat taaatgtata 10320 

attgcgggac tctaatcata aaaacccatc tcataaataa cgtcatgcat tacatgttaa 10380 

ttattacatg cttaacgtaa ttcaacagaa attatatgat aatcatcgca agaccggcaa 10440 

caggattcaa tcttaagaaa ctttattgcc aaatgtttga acgatcgggg atcatccggg 10500 

tctgtggcgg gaactccacg aaaatatccg aacgcagcaa gatatcgcgg tgcatctcgg 10560 

tcttgcctgg gcagtcgccg ccgacgccgt tgatgtggac gccgggcccg atcatattgt 10620 
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cgctcaggat cgtggcgttg tgcttgtcgg ccgttgctgt cgtaatgata tcggcacctt 10680 

cgaccgcctg ttccgcagag atcccgtggg cgaagaactc cagcatgaga tccccgcgct 10740 

ggaggatcat ccagccggcg tcccggaaaa cgattccgaa gcccaacctt tcatagaagg 10800 

cggcggtgga atcgaaatct cgtgatggca ggttgggcgt cgcttggtcg gtcatttcga 10860 

accccagagt cccgctcaga agaactcgtc aagaaggcga tagaaggcga tgcgctgcga 10920 

atcgggagcg gcgataccgt aaagcacgag gaagcggtca gcccattcgc cgccaagctc 10980 

ttcagcaata tcacgggtag ccaacgctat gtcctgatag cggtccgcca cacccagccg 11040 

gccacagtcg atgaatccag aaaagcggcc attttccacc atgatattcg gcaagcaggc 11100 

atcgccatgg gtcacgacga gatcatcgcc gtcgggcatg cgcgccttga gcctggcgaa 11160 

cagttcggct ggcgcgagcc cctgatgctc ttcgtccaga tcatcctgat cgacaagacc 11220 

ggcttccatc cgagtacgtg ctcgctcgat gcgatgtttc gcttggtggt cgaatgggca 11280 

ggtagccgga tcaagcgtat gcagccgccg cattgcatca gccatgatgg atactttctc 11340 

ggcaggagca aggtgagafg acaggagatc ctgccccggc acttcgccca atagcagcca 11400' 

gtcccttccc gcttcagtga caacgtcgag cacagctgcg caaggaacgc ccgtcgtggc 11460 

cagccacgat agccgcgctg cctcgtcctg cagttcattc agggcaccgg acaggtcggt 11520 

cttgacaaaa agaaccgggc gcccctgcgc tgacagccgg aacacggcgg catcagagca 11580 

gccgattgtc tgttgtgccc agtcatagcc gaatagcctc tccacccaag cggccggaga 11640 

acctgcgtgc aatccatctt gttcaatcat gcgaaacgat ccagatccgg tgcagattat 11700 

ttggattgag agtgaatatg agactctaat tggataccga ggggaattta tggaacgtca 11760 

gtggagcatt tttgacaaga aatatttgct agctgatagt gaccttaggc gacttttgaa 11820 

cgcgcaataa tggtttctga cgtatgtgct tagctcatta aactccagaa acccgcggct 11880 
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gagtggctcc ttcaacgttg cggttctgtc agttccaaac gtaaaacggc ttgtcccgcg 11940 

tcatcggcgg gggtcataac gtgactccct taattctccg ctcatgatca gattgtcgtt 12000 

tcccgccttc agtttaaact atcagtgttt gacaggatat attggcgggt aaacctaaga 12060 

gaaaagagcg tttattagaa taatcggata tttaaaaggg cgtgaaaagg tttatccgtt 12120 

cgtccatttg tatgtgcatg ccaaccacag ggttccccag atctggcgcc ggccagcgag 12180 

acgagcaaga ttggccgccg cccgaaacga tccgacagcg cgcccagcac aggtgcgcag 12240 

gcaaattgca ccaacgcata cagcgccagc agaatgccat agtgggcggt gacgtcgttc 12300 

gagtgaacca gatcgcgcag gaggcccggc agcaccggca taatcaggcc gatgccgaca 12360 

gcgtcgagcg cgacagtgct cagaattacg atcaggggta tgttgggttt cacgtctggc 12420 

ctccggacca gcctccgctg gtccgattga acgcgcggat tctttatcac tgataagttg 124 80 

gtggacatat tatgtttatc agtgataaag tgtcaagcat gac.aaagttg cagccgaata 12540 

cagtgatccg tgccgccctg gacctgttga acgaggtcgg cgtagacggt ctgacgacac 12600 

gcaaactggc ggaacggttg ggggttcagc agccggcgct ttactggcac ttcaggaaca 12660 

agcgggcgct gctcgacgca ctggccgaag ccatgctggc ggagaatcat acgcattcgg 12720 

tgccgagagc cgacgacgac tggcgctcat ttctgatcgg gaatgcccgc agcttcaggc 12780 

aggcgctgct cgcctaccgc gatggcgcgc gcatccatgc cggcacgcga ccgggcgcac 12840 

cgcagatgga aacggccgac gcgcagcttc gcttcctctg cgaggcgggt ttttcggccg 12900 

gggacgccgt caatgcgctg atgacaatca gctacttcac tgttggggcc gtgcttgagg 12960 

agcaggccgg cgacagcgat gccggcgagc gcggcggcac cgttgaacag gctccgctct 13020 

cgccgctgtt gcgggccgcg atagacgcct tcgacgaagc cggtccggac gcagcgttcg 13080 
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agcagggact cgcggtgatt gtcgatggat tggcgaaaag gaggctcgtt gtcaggaacg 13140 

ttgaaggacc gagaaagggt gacgattgat caggaccgct gccggagcgc aacccactca 13200 

ctacagcaga gccatgtaga caacatcccc tccccctttc caccgcgtca gacgcccgta 13260 

gcagcccgct acgggctttt tcatgccctg ccctagcgtc caagcctcac ggccgcgctc 13320 

ggcctctctg gcggccttct ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc 13380 

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac 13440 

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa 13500 

ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg acgagcatca 13560 

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc 13620 

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata 13680 

cctgtccgcc tttctccctt cgggaagcgt ggcgcttttc cgctgcataa ccctgcttcg 13740 

gggtcattat agcgattttt tcggtatatc catccttttt cgcacgatat acaggatttt 13800 

gccaaagggt tcgtgtagac tttccttggt gtatccaacg gcgtcagccg ggcaggatag 13860 

gtgaagtagg cccacccgcg agcgggtgtt ccttcttcac tgtcccttat tcgcacctgg 13920 

cggtgctcaa cgggaatcct gctctgcgag gctggccggc taccgccggc gtaacagatg 13980 

agggcaagcg gatggctgat gaaaccaagc caaccaggaa gggcagccca cctatcaagg 14040 

tgtactgcct tccagacgaa cgaagagcga ttgaggaaaa ggcggcggcg gccggcatga 14100 

gcctgtcggc ctacctgctg gccgtcggcc agggctacaa aatcacgggc gtcgtggact 14160 

atgagcacgt ccgcgagctg gcccgcatca atggcgacct gggccgcctg ggcggcctgc 14220 

tgaaactctg gctcaccgac gacccgcgca cggcgcggtt cggtgatgcc acgatcctcg 14280 

ccctgctggc gaagatcgaa gagaagcagg acgagcttgg caaggtcatg atgggcgtgg 14340 



BASF AG 246/365 January 08, 2004 

BASF NAE 877/03 

tccgcccgag ggcagagcca tgactttttt agccgctaaa acggccgggg ggtgcgcgtg 14400 

attgccaagc acgtccccat gcgctccatc aagaagagcg acttcgcgga gctggtgaag 14460 

tacatcaccg acgagcaagg caagaccgag cgcctttgcg acgctcaccg ggctggttgc 14520 

cctcgccgct gggctggcgg ccgtctatgg ccctgcaaac gcgccagaaa cgccgtcgaa 14580 

gccgtgtgcg agacaccgcg gccgccggcg ttgtggatac ctcgcggaaa acttggccct 14640 

cactgacaga tgaggggcgg acgttgacac ttgaggggcc gactcacccg gcgcggcgtt 14700 

gacagatgag gggcaggctc gatttcggcc ggcgacgtgg agctggccag cctcgcaaat 14760 

cggcgaaaac gcctgatttt acgcgagttt cccacagatg atgtggacaa gcctggggat 14820 

aagtgccctg cggtattgac acttgagggg cgcgactact gacagatgag gggcgcgatc 14880 

cttgacactt gaggggcaga gtgctgacag atgaggggcg cacctattga catttgaggg 14940 

gctgtccaca ggcagaaaat ccagcatttg caagggtttc cgcccgtttt tcggccaccg 15000 

ctaacctgtc ttttaacctg cttttaaacc aatatttata aaccttgttt ttaaccaggg 15060 

ctgcgccctg tgcgcgtgac cgcgcacgcc gaaggggggt gccccccctt ctcgaaccct 15120 

cccggcccgc taacgcgggc ctcccatccc cccaggggct gcgcccctcg gccgcgaacg 15180 

gcctcacccc aaaaatggca gcgctggcag tccttgccat tgccgggatc ggggcagtaa 15240 

cgggatgggc gatcagcccg agcgcgacgc ccggaagcat tgacgtgccg caggtgctgg 15300 

catcgacatt cagcgaccag gtgccgggca gtgagggcgg cggcctgggt ggcggcctgc 15360 

ccttcacttc ggccgtcggg gcattcacgg acttcatggc ggggccggca atttttacct 15420 

tgggcattct tggcatagtg gtcgcgggtg ccgtgctcgt gttcgggggt gcgataaacc ' 15480 

cagcgaacca tttgaggtga taggtaagat tataccgagg tatgaaaacg agaattggac 15540 
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ctttacagaa ttactctatg aagcgccata tttaaaaagc taccaagacg aagaggatga 15600 

agaggatgag gaggcagatt gccttgaata tattgacaat actgataaga taatatatct 15660 

tttatataga agatatcgcc gtatgtaagg atttcagggg gcaaggcata ggcagcgcgc 15720 

ttatcaatat atctatagaa tgggcaaagc ataaaaactt gcatggacta atgcttgaaa 15780 

cccaggacaa taaccttata gcttgtaaat tctatcataa ttgggtaatg actccaactt 15840 

attgatagtg ttttatgttc agataatgcc cgatgacttt gtcatgcagc tccaccgatt 15900 

ttgagaacga cagcgacttc cgtcccagcc gtgccaggtg ctgcctcaga ttcaggttat 15960 

gccgctcaat tcgctgcgta tatcgcttgc tgattacgtg cagctttccc ttcaggcggg 16020 

attcatacag cggccagcca tccgtcatcc atatcaccac gtcaaagggt gacagcaggc 16080 

tcataagacg ccccagcgtc gccatagtgc gttcaccgaa tacgtgcgca acaaccgtct 16140 

tccggagact gtcatacgcg taaaacagcc agcgctggcg cgatttagcc ccgacatagc 16200 

cccactgttc gtccatttcc gcgcagacga tgacgtcact gcccggctgt atgcgcgagg 16260 

ttaccgactg cggcctgagt tttttaagtg acgtaaaatc gtgttgaggc caacgcccat 16320 

aatgcgggct gttgcccggc atccaacgcc attcatggcc atatcaatga ttttctggtg 16380 

cgtaccgggt tgagaagcgg tgtaagtgaa ctgcagttgc catgttttac ggcagtgaga 16440 

gcagagatag cgctgatgtc cggcggtgct tttgccgtta cgcaccaccc cgtcagtagc 16500 

tgaacaggag ggacagctga tagacacaga agccactgga gcacctcaaa aacaccatca 16560 

tacactaaat cagtaagttg gcagcatcac ccataattgt ggtttcaaaa tcggctccgt 16620 

cgatactatg ttatacgcca actttgaaaa caactttgaa aaagctgttt tctggtattt 16680 

aaggttttag aatgcaagga acagtgaatt ggagttcgtc ttgttataat tagcttcttg 16740 

gggtatcttt aaatactgta gaaaagagga aggaaataat aaatggctaa aatgagaata 16800 
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tcaccggaat tgaaaaaact gatcgaaaaa taccgctgcg taaaagatac ggaaggaatg 16860 

tctcctgcta aggtatataa gctggtggga gaaaatgaaa acctatattt aaaaatgacg 16920 

gacagccggt ataaagggac cacctatgat gtggaacggg aaaaggacat gatgctatgg 16980 

ctggaaggaa agctgcctgt tccaaaggtc ctgcactttg aacggcatga tggctggagc 17040 

aatctgctca tgagtgaggc cgatggcgtc ctttgctcgg aagagtatga agatgaacaa 17100 

agccctgaaa agattatcga gctgtatgcg gagtgcatca ggctctttca ctccatcgac 17160 

atatcggatt gtccctatac gaatagctta gacagccgct tagccgaatt ggattactta 17220 

ctgaataacg atctggccga tgtggattgc gaaaactggg aagaagacac tccatttaaa 17280 

gatccgcgcg agctgtatga ttttttaaag acggaaaagc ccgaagagga acttgtcttt 17340 

tcccacggcg acctgggaga cagcaacatc tttgtgaaag atggcaaagt aagtggcttt 17400 

attgatcttg ggagaagcgg cagggcggac aagtggtatg acattgcctt ctgcgtccgg 174 60 

tcgatcaggg aggatatcgg ggaagaacag tatgtcgagc tattttttga cttactgggg 17520 

atcaagcctg attgggagaa aataaaatat tatattttac tggatgaatt gttttagtac 17580 

ctagatgtgg cgcaacgatg ccggcgacaa gcaggagcgc accgacttct tccgcatcaa 17640 

gtgttttggc tctcaggccg aggcccacgg caagtatttg ggcaaggggt cgctggtatt 17700 

cgtgcagggc aagattcgga ataccaagta cgagaaggac ggccagacgg tctacgggac 17760 

cgacttcatt gccgataagg tggattatct ggacaccaag gcaccaggcg ggtcaaatca 17820 

ggaataaggg cacattgccc cggcgtgagt cggggcaatc ccgcaaggag ggtgaatgaa 17880 

tcggacgttt gaccggaagg catacaggca agaactgatc gacgcggggt tttccgccga 17940 

ggatgccgaa accatcgcaa gccgcaccgt catgcgtgcg ccccgcgaaa ccttccagtc 18000 
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cgtcggctcg atggtccagc aagctacggc caagatcgag cgcgacagcg tgcaactggc 18060 

tccccctgcc ctgcccgcgc catcggccgc cgtggagcgt tcgcgtcgtc tcgaacagga 18120 

ggcggcaggt ttggcgaagt cgatgaccat cgacacgcga ggaactatga cgaccaagaa 18180 

gcgaaaaacc gccggcgagg acctggcaaa acaggtcagc gaggccaagc aggccgcgtt 18240 

gctgaaacac acgaagcagc agatcaagga aatgcagctt tccttgttcg atattgcgcc 18300 

gtggccggac acgatgcgag cgatgccaaa cgacacggcc cgctctgccc tgttcaccac 18360 

gcgcaacaag aaaatcccgc gcgaggcgct gcaaaacaag gtcattttcc acgtcaacaa 18420 

ggacgtgaag atcacctaca ccggcgtcga gctgcgggcc gacgatgacg aactggtgtg 184 80 

gcagcaggtg ttggagtacg cgaagcgcac ccctatcggc gagccgatca ccttcacgtt 18540 

ctacgagctt tgccaggacc tgggctggtc gatcaatggc cggtattaca cgaaggccga 18600 

ggaatgcctg tcgcgcctac aggcgacggc gatgggcttc acgtccgacc gcgttgggca 18660 

cctggaatcg gtgtcgctgc tgcaccgctt ccgcgtcctg gaccgtggca agaaaacgtc 18720 

ccgttgccag gtcctgatcrg acgaggaaat cgtcgtgctg tttgctggcg accactacac 18780* 

gaaattcata tgggagaagt accgcaagct gtcgccgacg gcccgacgga tgttcgacta 18840 

tttcagctcg caccgggagc cgtacccgct caagctggaa accttccgcc tcatgtgcgg 18900 

atcggattcc acccgcgtga agaagtggcg cgagcaggtc ggcgaagcct gcgaagagtt 18960 

gcgaggcagc ggcctggtgg aacacgcctg ggtcaatgat gacctggtgc attgcaaacg 19020 

ctagggcctt gtggggtcag ttccggctgg gggttcagca gccagcgctt tactggcatt 19080 

tcaggaacaa gcgggcactg ctcgacgcac ttgcttcgct cagtatcgct cgggacgcac 19140 

ggcgcgctct acgaactgcc gataaacaga ggattaaaat tgacaattgt gattaaggct 19200 

cagattcgac ggcttggagc ggccgacgtg caggatttcc gcgagatccg attgtcggcc 19260 
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ctgaagaaag ctccagagat gttcgggtcc gtttacgagc acgaggagaa aaagcccatg 19320 

gaggcgttcg ctgaacggtt gcgagatgcc gtggcattcg gcgcctacat cgacggcgag 19380 

atcattgggc tgtcggtctt caaacaggag gacggcccca aggacgctca caaggcgcat 19440 

ctgtccggcg ttttcgtgga gcccgaacag cgaggccgag gggtcgccgg tatgctgctg 19500 

cgggcgttgc cggcgggttt attgctcgtg atgatcgtcc gacagattcc aacgggaatc 19560 

tggtggatgc gcatcttcat cctcggcgca cttaatattt cgctattctg gagcttgttg 19620 

tttatttcgg tctaccgcct gccgggcggg gtcgcggcga cggtaggcgc tgtgcagccg 19680 

ctgatggtcg tgttcatctc tgccgctctg ctaggtagcc cgatacgatt gatggcggtc 19740 

ctgggggcta tttgcggaac tgcgggcgtg gcgctgttgg tgttgacacc aaacgcagcg 19800 

ctagatcctg tcggcgtcgc agcgggcctg gcgggggcgg tttccatggc gttcggaacc 19860 

gtgctgaccc gcaagtggca acctcccgtg cctctgctca cctttaccgc ctggcaactg 19920 

gcggccggag gacttctgct cgttccagta gctttagtgt ttgatccgcc aatcccgatg 19980 

cctacaggaa ccaatgttct cggcctggcg tggctcggcc tgatcggagc gggtttaacc 20040 

tacttccttt ggttccgggg gatctcgcga ctcgaaccta cagttgtttc cttactgggc 20100 

tttctcagcc ccagatctgg ggtcgatcag ccggggatgc atcaggccga cagtcggaac 20160 

ttcgggtccc cgacctgtac cattcggtga gcaatggata ggggagttga tatcgtcaac 20220 

gttcacttct aaagaaatag cgccactcag cttcctcagc ggctttatcc agcgatttcc 20280 

tattatgtcg gcatagttct caagatcgac agcctgtcac ggttaagcga gaaatgaata 20340 

agaaggctga taattcggat ctctgcgagg gagatgatat ttgatcacag gcagcaacgc 20400 

tctgtcatcg ttacaatcaa catgctaccc tccgcgagat catccgtgtt tcaaacccgg 20460 
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cagcttagtt gccgttcttc cgaatagcat cggtaacatg agcaaagtct gccgccttac 20520 

aacggctctc ccgctgacgc cgtcccggac tgatgggctg cctgtatcga gtggtgattt 20580 

tgtgccgagc tgccggtcgg ggagctgttg gctggctggt ggcaggatat attgtggtgt 20640 

aaacaaattg acgcttagac aacttaataa cacattgcgg acgtttttaa tgtactgggg 20700 

tggtttttct tttcaccagt gagacgggca acagctgatt gcccttcacc gcctggccct 20760 

gagagagttg cagcaagcgg tccacgctgg tttgccccag caggcgaaaa tcctgtttga 20820 

tggtggttcc gaaatcggca aaatccctta taaatcaaaa gaatagcccg agatagggtt 20880 

gagtgttgtt ccagtttgga acaagagtcc actattaaag aacgtggact ccaacgtcaa 20940 

agggcgaaaa accgtctatc agggcgatgg cccactacgt gaaccatcac ccaaatcaag 21000 

ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac cctaaaggga gcccccgatt 21060 

tagagcttga cggggaaagc cggcgaacgt ggcgagaaag gaagggaaga aagcgaaagg 21120 

agcgggcgcc attcaggctg cgcaactgtt gggaagggcg atcggtgcgg gcctcttcgc 21180 

tattacgcca gctggcgaaa gggggatgtg ctgcaaggcg attaagttgg gtaacgccag 21240 

ggttttccca gtcacgacgt tgtaaaacga cggccagtga attcgagctc ggtacccggg 21300 

<210> 47 

<211> 17756 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 
<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 
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<223> n is a, c, g, or t 
<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10563) . . (10563) 

<223> n is a, c, g, or t 

<400> 47 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg- 780 



ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 
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ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 
gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 
ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 
acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 
acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 
agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 
ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 
ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 
atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 
agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 
agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 
cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 
ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 
gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 
gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 
tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 
ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 
tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 
tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 
ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 
aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
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aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagaeggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 
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gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 
gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 
ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 
aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 
gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 
gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 
tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 
agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 
tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 
ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 
tccacgtcaa paaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 
acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 
tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 
acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 
accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 
gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 
gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 
ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 
gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 
cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 
tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 



3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
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ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 474 0 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 
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tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 
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aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 
aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 
aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 
cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 
tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 
tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 
tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 
-tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 
gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 
tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 
tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 
gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 
atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 
cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 
ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 
tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 
cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 
accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 
tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag- 
acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 



7080 
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cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcctc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 
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tctcgatgag ctgatgcttt gggccgagga 
ggatttcggc tccaacaatg tcctgacgga 
gagcgaggcg atgttcgggg attcccaata 
gtggttggct tgtatggagc agcagacgcg 
aggatcgccg cggctccggg cgtatatgct 
cttggttgac ggcaatttcg atgatgcagc 
ccgatccgga gccgggactg tcgggcgtac 
gaccgatggc tgtgtagaag tactcgccga 
gagggcaaag gaatagagta gatgccgacc 
tcatcaaaca gcttgacgaa tctggatata 
ttgagacaaa tggtgttcag gatctcgata 
gtgccttcta gtgatttaat agctccatgt 
cctcttccag atacagctca tctgcaatgc 
cttncaggct ccggcgaaga gaagaatagc 
gagatcaagc agatcaacgg tcgtcaagag 
tccacgcgac tatatatttg tctctaattg 
atagcttgac tatgaaaatt ccgtcaccag 
tcttccttga actctcaagc ctacaggaca 
canttcctac taagatggta tacaatagta 
taacacccaa tacgccggcc gaaacttttt 
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ctgccccgaa gtccggcacc tcgtgcacgc 9540 

caatggccgc ataacagcgg tcattgactg 9600 

cgaggtcgcc aacatcttct tctggaggcc 9660 

ctacttcgag cggaggcatc cggagcttgc 9720 

ccgcattggt cttgaccaac tctatcagag 9780 

ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

acaaatcgcc cgcagaagcg cggccgtctg 9900 

tagtggaaac cgacgcccca gcactcgtcc 9960 

gcgggatcga tccacttaac gttactgaaa 10020 

agatcgttgg tgtcgatgtc agctccggag 10080 

agatacgttc atttgtccaa gcagcaaaga 10140 

caacaagaat aaaacgcgtt ttcgggttta 10200 

attaatgcat tgactgcaac ctagtaacgc 10260 

ttagcagagc tattttcatt ttcgggagac 10320 

acctacgaga ctgaggaatc cgctcttggc 10380 

tactttgaca tgctcctctt ctttactctg 10440 

cncctgggtt cgcaaagata attgcatgtt 10500 

cacattcatc gtaggtataa acctcgaaat 10560 

accatgcatg gttgcctagt gaatgctccg 10620 

tacaactctc ctatgagtcg tttacccaga 10680 
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atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt cattttgctt 10800 

tgtaaatttc tggtaactgc caccaagaaa tatgaggata ttcgtgatgt tcctcgtggt 10860 

agccaaaatg atagcacgtg ataaatgacc accaaatagg acggctaatt gtttgggcac 10920 

aatgaggctg aacataaccc cctattggtt cactatgggg taaaaaagta ccaaaataga 10980 

ataattgtaa tgaacttaaa agcgagggta gcacccaaaa gtaagttaga ttatcacttg 11040 

ggatatggag tatgtattta gcaaagttat aaataatagt caacgcaatt atttgccccc 11100 

aactccagta acctttcata aaatgaaaat accaagcaaa gaaactttgg tgtttaccat 11160 

tgtgaaaatc cgggtctatt gagcttgctg gattgtggtg gtgtaaccaa tgttttttca 11220 

atagtttttg atatggtaaa agaccataaa gggatagggt caatgttcca atcaaatgat 11280 

taatcttggt gttttgggga aatactacgc catgcatggc atcatgagat gtaataaata 11340 

atcccgtata taaaaatgtt tgccatagta taacaggcaa taacatccaa aattttagct 11400 

ttgagatgtc aagggaaagt aataaactca ggctaatgac ccatgcgcta acaatgacaa 11460 

tagcaatgaa aagcccctta aactgagatt tacttctcag tactggagtc agttttgctt 11520 

gatgactgag tggttgttct aactggatca tttctaaaga gaaggtggaa caatgttagc 11580 

ataattgtgc ttgagtgagg actttgaggg taggtacata cttgataaag ttaatgatta 11640 

aagagaaaaa aaaagttttg gttcaaagca gaaattgttt tttaaatcga ttggtgagaa 11700 

aatttttttc tgtttccgca tcaccaaagc cacctcagga atggtcacaa attattggtc 11760 

tgattggacc ataagcatac aaaaagttca ttgaagtata cttagtggct tattagactt 11820 

ttatcgtttt ctaacgcgaa tcagcaatgt ttcttgtttg atttactgct tgctttagat 11880 

catttttgtc tgaaatatta tgcatttgtt caaagcggcc tttgtttcct ttctttcatg 11940 
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cttaaacacg ttgtttattc catatattac tttgaatatg catcaccgca aagcggaagt 12000 

gcaaaataac aaagaacctc tttgggttac acgatcaact gctattgtga aaaaaatttc 12060 

tttttgaaaa tttttggaat aatatctctt gcaaaaaaga aattttgtat atttagtagc 12120 

atcaagaaca aatgaaagaa gtgtgggata acaagaatac atcatcttta gacaaaagta 12180 

cgagaaaaat ctaataagtt gttatagagg tctttgtttt ctttgtgttt atagacagtt 12240 

atttagagtt tgaaaagtgt ctctaatgtg tcttttttta ttattattat ttcaaatgtt 12300 

atgtaatata gctaaagcta tagatttgac attttttcta aatataaaat ttcagtcaac 12360 

agaaataaat gacacgagtt ctttttctct ctctcaatcc tgttgatcat caatctttga 12420 

tgtcgtttta aaacaaatga atggcattta gttccttagg tgtcactcac atcttgttga 12480 

ccagaaaatc cttattcgcc ctcaaatctg ctttattcct ttcatttgat ttgatgttta 12540 

agtaatgcaa gcaaacaaaa aagaaacctt tcttgcaaag acaaaagaat tgttttcaga 12600 

ggaaagcaac tcgttgtcat tttttaagga tttagactta taatcgacac catagtttgt 12660 

ccgttacatt ttttattgtc gttttctgat ttccttttaa tctttaagca aaatcaatat 12720 

taacttatct tgtcttccaa taaaaaatgg ataccaataa caataaatcc ttcacaaaga 12780 

aaaaaaaaaa aaactcgaaa aaagcttggc gtaatcatgg tcatagctgt ttcctgtgtg 12840 

aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc 12900 

ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt 12960 

ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg 13020 

cggtttgcgt attgggccaa agacaaaagg gcgacattca accgattgag ggagggaagg 13080 

taaatattga cggaaattat tcattaaagg tgaattatca ccgtcaccga cttgagccat 13140 
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ttgggaatta gagccagcaa aatcaccagt agcaccatta ccattagcaa ggccggaaac 13200 

gtcaccaatg aaaccatcga tagcagcacc gtaatcagta gcgacagaat caagtttgcc 13260 

tttagcgtca gactgtagcg cgttttcatc ggcattttcg gtcatagccc ccttattagc 13320 

gtttgccatc ttttcataat caaaatcacc ggaaccagag ccaccaccgg aaccgcctcc 13380 

ctcagagccg ccaccctcag aaccgccacc ctcagagcca ccaccctcag agccgccacc 13440 

agaaccacca ccagagccgc cgccagcatt gacaggaggc ccgatctagt aacatagatg 13500 

acaccgcgcg cgataattta tcctagtttg cgcgctatat tttgttttct atcgcgtatt 13560 

aaatgtataa ttgcgggact ctaatcataa aaacccatct cataaataac gtcatgcatt 13620 

acatgttaat tattacatgc ttaacgtaat tcaacagaaa ttatatgata atcatcgcaa 13680 

gaccggcaac aggattcaat cttaagaaac tttattgcca aatgtttgaa cgatcgggga 13740 

tcatccgggt ctgtggcggg aactccacga aaatatccga acgcagcaag atatcgcggt 13800 

gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt gatgtggacg ccgggcccga 13860 

tcatattgtc gctcaggatrc gtggcgttgt gcttgtcggc cgttgctgtc gtaatgatat 13920 

cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc gaagaactcc agcatgagat 13980 

ccccgcgctg gaggatcatc cagccggcgt cccggaaaac gattccgaag cccaaccttt 14040 

catagaaggc ggcggtggaa tcgaaatctc gtgatggcag gttgggcgtc gcttggtcgg 14100 

tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat 14160 

gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc 14220 

gccaagctct tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac 14280 

acccagccgg ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg 14340 

caagcaggca tcgccatggg tcacgacgag atcatcgccg tcgggcatgc gcgccttgag 14400 
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cctggcgaac agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc 
gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc 
gaatgggcag gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga 
tactttctcg gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa 
tagcagccag tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc 
cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga 
caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc 
•atcagagcag ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc 
ggccggagaa cctgcgtgca atccatcttg ttcaatcatg cgaaacgatc cagatccggt 
gcagattatt tggattgaga gtgaatatga gactctaatt ggataccgag gggaatttat 
ggaacgtcag tggagcattt ttgacaagaa atatttgcta gctgatagtg accttaggcg 
acttttgaac gcgcaataat ggtttctgac gtatgtgctt agctcattaa actccagaaa 
cccgcggctg agtggctcct tcaacgttgc ggttctgtca gttccaaacg taaaacggct 
tgtcccgcgt catcggcggg ggtcataacg tgactccctt aattctccgc tcatgatcag 
attgtcgttt cccgccttca gtttaaacta tcagtgtttg acaggatata ttggcgggta 
aacctaagag aaaagagcgt ttattagaat aatcggatat ttaaaagggc gtgaaaaggt 
ttatccgttc gtccatttgt atgtgcatgc caaccacagg gttccccaga tctggcgccg 
gccagcgaga cgagcaagat tggccgccgc ccgaaacgat ccgacagcgc gcccagcaca 
ggtgcgcagg caaattgcac caacgcatac agcgccagca gaatgccata gtgggcggtg ■ 
acgtcgttcg agtgaaccag atcgcgcagg aggcccggca gcaccggcat aatcaggccg 



14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
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atgccgacag cgtcgagcgc gacagtgctc agaattacga tcaggggtat gttgggtttc 15660 

acgtctggcc tccggaccag cctccgctgg tccgattgaa cgcgcggatt ctttatcact 15720 

gataagttgg tggacatatt atgtttatca gtgataaagt gtcaagcatg acaaagttgc 15780 

agccgaatac agtgatccgt gccgccctgg acctgttgaa cgaggtcggc gtagacggtc 15840 

tgacgacacg caaactggcg gaacggttgg gggttcagca gccggcgctt tactggcact 15900 

tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc catgctggeg gagaatcata 15960 

cgcattcggt gccgagagcc gacgacgact ggcgctcatt tctgatcggg aatgcccgca 16020 

gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg catccatgcc ggcacgcgac 16080 

cgggcgcacc gcagatggaa acggccgacg cgcagcttcg cttcctctgc gaggcgggtt 16140 

tttcggccgg ggacgccgtc aatgcgctga tgacaatcag ctacttcact gttggggccg 16200 

tgcttgagga gcaggccggc gacagcgatg ccggcgagcg cggcggcacc gttgaacagg 16260 

ctccgctctc gccgctgttg cgggccgcga tagacgcctt cgacgaagcc ggtccggacg 16320 

9 

cagcgttcga gcagggactc gcggtgattg tcgatggatt ggcgaaaagg aggctcgttg 16380 

tcaggaacgt tgaaggaccg agaaagggtg acgattgatc aggaccgctg ccggagcgca 16440 

acccactcac tacagcagag ccatgtagac aacatcccct ccccctttcc accgcgtcag 16500 

acgcccgtag cagcccgcta cgggcttttt catgccctgc cctagcgtcc aagcctcacg 16560 

gccgcgctcg gcctctctgg cggccttctg gcgctcttcc gcttcctcgc tcactgactc 16620 

gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct cactcaaagg cggtaatacg 16680 

gttatccaca gaatcagggg ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa 16740 

ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc cataggctcc gcccccctga 16800 

cgagcatcac aaaaatcgac gctcaagtca gaggtggcga aacccgacag gactataaag 16860 
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ataccaggcg tttccccctg gaagctccct cgtgcgctct cctgttccga ccctgccgct 16920 

taccggatac ctgtccgcct ttctcccttc gggaagcgtg gcgcttttcc gctgcataac 16980 

cctgcttcgg ggtcattata gcgatttttt cggtatatcc atcctttttc gcacgatata 17040 

caggattttg ccaaagggtt cgtgtagact ttccttggtg tatccaacgg cgtcagccgg 17100 

gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc cttcttcact gtcccttatt 17160 

cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg ctggccggct accgccggcg 17220 

taacagatga gggcaagcgg atggctgatg aaaccaagcc aaccaggaag ggcagcccac 17280 

ctatcaaggt gtactgcctt ccagacgaac gaagagcgat tgaggaaaag gcggcggcgg 17340 

ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca gggctacaaa atcacgggcg 17400 

tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa tggcgacctg ggccgcctgg 174 60 

gcggcctgct gaaactctgg ctcaccgacg acccgcgcac ggcgcggttc ggtgatgcca 17520 

cgatcctcgc cctgctggcg aagatcgaag agaagcagga cgagcttggc aaggtcatga 17580 

tgggcgtggt ccgcccgagg gcagagccat gactttttta gccgctaaaa cggccggggg 17640 

gtgcgcgtga ttgccaagca cgtccccatg cgctccatca agaagagcga cttcgcggag 17700 

ctggtgaagt acatcaccga cgagcaaggc aagaccgagc gcctttgcga cgctca 17756 
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<220> 

<221> misc^feature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10563) . . (10563) 

<223> n is a, c, q, or t 



<400> 48 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



60 



aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 



aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 



180 



ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 



tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 



600 



cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 
tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 



atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 
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ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 
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ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaatra acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 
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cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc ■ 4380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 
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cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 5460 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg -taccattcgg tgagcaatgg ataggggagt 5700 
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tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 
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gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7 620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 
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acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 
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accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgagg.tcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 
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taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 

tatgatccag ttagaacaac cactcagtca tcaagcaaaa ctgactccag tactgagaag 114 60 

taaatctcag tttaaggggc ttttcattgc tattgtcatt gttagcgcat gggtcattag 11520 

cctgagttta ttactttccc ttgacatctc aaagctaaaa ttttggatgt tattgcctgt 11580 

tatactatgg caaacatttt tatatacggg attatttatt acatctcatg atgccatgca 11640 

tggcgtagta tttccccaaa acaccaagat taatcatttg attggaacat tgaccctatc 11700 

cctttatggt cttttaccat atcaaaaact attgaaaaaa cattggttac accaccacaa 1 11760 

tccagcaagc tcaatagacc cggattttca caatggtaaa caccaaagtt tctttgcttg 11820 
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gtattttcat tttatgaaag gttactggag ttgggggcaa ataattgcgt tgactattat 11880 

ttataacttt gctaaataca tactccatat cccaagtgat aatctaactt acttttgggt 11940 

gctaccctcg cttttaagtt cattacaatt attctatttt ggtacttttt taccccatag 12000 

tgaaccaata gggggttatg ttcagcctca ttgtgcccaa acaattagcc gtcctatttg 12060 

gtggtcattt atcacgtgct atcattttgg ctaccacgag gaacatcacg aatatcctca 12120 

tatttcttgg tggcagttac cagaaattta caaagcaaaa tagaagcttg gcgtaatcat 12180 

ggtcatagct gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag 12240 

ccggaagcat aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg 12300 

cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa 12360 

tcggccaacg cgcggggaga ggcggttcgc gtattgggcc aaagacaaaa gggcgacatt 12420 

caaccgattg agggagggaa ggtaaatatt gacggaaatt attcattaaa ggtgaattat 12480 

caccgtcacc gacttgagcc atttgggaat tagagccagc aaaatcacca gtagcaccat 12540 

taccattagc aaggccggaa acgtcaccaa tgaaaccatc gatagcagca ccgtaatcag 12600 

tagcgacaga atcaagtttg cctttagcgt cagactgtag cgcgttttca tcggcatttt 12660 

cggtcatagc ccccttatta gcgtttgcca tcttttcata atcaaaatca ccggaaccag 12720 

agccaccacc ggaaccgcct ccctcagagc cgccaccctc agaaccgcca ccctcagagc 12780 

caccaccctc agagccgcca ccagaaccac caccagagcc gccgccagca ttgacaggag 12840 

gcccgatcta gtaacataga tgacaccgcg cgcgataatt tatcctagtt tgcgcgctat 12900 

attttgtttt ctatcgcgta ttaaatgtat aattgcggga ctctaatcat aaaaacccat 12960 

ctcataaata acgtcatgca ttacatgtta attattacat gcttaacgta attcaacaga 13020 

aattatatga taatcatcgc aagaccggca acaggattca atcttaagaa actttattgc 13080 
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caaatgtttg aacgatcggg gatcatccgg gtctgtggcg ggaactccac gaaaatatcc 13140 

gaacgcagca agatatcgcg gtgcatctcg gtcttgcctg ggcagtcgcc gccgacgccg 13200 

ttgatgtgga cgccgggccc gatcatattg tcgctcagga tcgtggcgtt gtgcttgtcg 13260 

gccgttgctg tcgtaatgat atcggcacct tcgaccgcct gttccgcaga gatcccgtgg 13320 

gcgaagaact ccagcatgag atccccgcgc tggaggatca tccagccggc gtcccggaaa 13380 

acgattccga agcccaacct ttcatagaag gcggcggtgg aatcgaaatc tcgtgatggc 13440 

aggttgggcg tcgcttggtc ggtcatttcg aaccccagag tcccgctcag aagaactcgt 13500 

caagaaggcg atagaaggcg atgcgctgcg aatcgggagc ggcgataccg taaagcacga 13560 

ggaagcggtc agcccattcg ccgccaagct cttcagcaat atcacgggta gccaacgcta 13620 

tgtcctgata gcggtccgcc acacccagcc ggccacagtc gatgaatcca gaaaagcggc 13680 

cattttccac catgatattc ggcaagcagg catcgccatg ggtcacgacg agatcatcgc 13740 

cgtcgggcat gcgcgccttg agcctggcga acagttcggc tggcgcgagc ccctgatgct 13800 

cttcgtccag atcatcctga tcgacaagac cggcttccat ccgagtacgt gctcgctcga 13860 

tgcgatgttt cgcttggtgg tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc 13920 

gcattgcatc agccatgatg gatactttct cggcaggagc aaggtgagat gacaggagat 13980 

cctgccccgg cacttcgccc aatagcagcc agtcccttcc cgcttcagtg acaacgtcga 14040 

gcacagctgc gcaaggaacg cccgtcgtgg ccagccacga tagccgcgct gcctcgtcct 14100 

gcagttcatt cagggcaccg gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg 14160 

ctgacagccg gaacacggcg gcatcagagc agccgattgt ctgttgtgcc cagtcatagc 14220 

cgaatagcct ctccacccaa gcggccggag aacctgcgtg caatccatct tgttcaatca 14280 
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tgcgaaacga tccagatccg gtgcagatta tttggattga gagtgaatat gagactctaa 14340 

ttggataccg aggggaattt atggaacgtc agtggagcat ttttgacaag aaatatttgc 14400 

tagctgatag tgaccttagg cgacttttga acgcgcaata atggtttctg acgtatgtgc 14 4 60 

ttagctcatt aaactccaga aacccgcggc tgagtggctc cttcaacgtt gcggttctgt 14520 

cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa cgtgactccc 14580 

ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac tatcagtgtt 14 640 

tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga ataatcggat 14700 

atttaaaagg gcgtgaaaag gtttatccgt tcgtccattt gtatgtgcat gccaaccaca 14760 

gggttcccca gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg 14820 

atccgacagc gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag 14880 

cagaatgcca tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg 14 940 

cagcaccggc ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac 15000 

gatcaggggt atgttgggti: tcacgtctgg cctccggacc agcctccgct ggtccgattg 15060* 

aacgcgcgga ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa 15120 

gtgtcaagca tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg 15180 

aacgaggtcg gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag 15240 

cagccggcgc tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa 15300 

gccatgctgg cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca 15360 

tttctgatcg ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg 15420 

cgcatccatg ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt 154 80 

cgcttcctct gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc 15540 
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agctacttca ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag 15600 

cgcggcggca ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc 15660 

ttcgacgaag ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga 15720 

ttggcgaaaa ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga 15780 

tcaggaccgc tgccggagcg caacccactc actacagcag agccatgtag acaacatccc 15840 

ctcccccttt ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct 15900 

gccctagcgt ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt 15960 

ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag 16020 

ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca 16080 

tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt 16140 

tccataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc 16200 

gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct 16260 

ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg 16320 

tggcgctttt ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat 16380 

ccatcctttt tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg 16440 

tgtatccaac ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt 16500 

tccttcttca ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga 16560 

ggctggccgg ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag 16620 

ccaaccagga agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg 16680 

attgaggaaa aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc 16740 
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cagggctaca aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc 16800 

aatggcgacc tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc 16860 

acggcgcggt tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag 16920 

gacgagcttg gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt 16980 

tagccgctaa aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat 17040 

caagaagagc gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga 17100 

gcgcctttgc gacgctca 17118 



<210> 49 

<211> 18449 

<212> DNA 

<213> Artificial 

<220> 

<223> Plasmid 



<220> 

<221> misc_f eature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (3770) . . (3770) 

<223> n is a, c, g, or t 



<400> 49 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 
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cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 

gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 

ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 
gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct • 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 
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aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 1440 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 

agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 24 60 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 
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atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 2940 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 

gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcgttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 
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ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caaagcttga gattaaaata gataaggaaa 4020 

agaaagtgaa aagaaattcg gaagcatggc acattcttct ttttataaat acatgcctga 4080 

ctttcttttt ccatcgatat gatatatgca tatgatagat atacaagcaa tcttcttcaa 4140 

ggagtttgaa attttgtcct ccaggagcaa aaaaaagttt ttttttatac atgtttgtac 4200 

acaagaatag ttaccaattt gctttggtct tacgtgctgc aagtttatat cgttttcaat 4260 

ttctttgtct ttacattttc tttgtccttt atctttcctc atttagtctt tgggagaatt 4320 

aggaaaaggg agcggaaagg taagaaatgc ttgcgtattt tactaattcg gcaaacatcc 4380 

aatttggcaa acagcagcct gtgcaacgct ctcgagatga cagtatcttt gattacactc 4440 

taaatctcga tgacccgacc aaaaagagcg aacaaagaaa taatcttgtg cattcgaata 4500" 

tgatggaaga ttttttcccc cttattctaa atgttgacat agcgtgtatg ttatataaac 4560 

aaaaagaaat tgtacaaact ttcttttctt ctctttttat tttatctcta tgctgtcgaa 4620 

gctgcagtca atcagcgtca aggcccgccg cgttgaacta gcccgcgaca tcacgcggcc 4 680 

caaagtctgc ctgcatgctc agcggtgctc gttagttcgg ctgcgagtgg cagcaccaca 4740 

gacagaggag gcgctgggaa ccgtgcaggc tgccggcgcg ggcgatgagc acagcgccga 4800 

tgtagcactc cagcagcttg accgggctat cgcagagcgt cgtgcccggc gcaaacggga 4860 

gcagctgtca taccaggctg ccgccattgc agcatcaatt ggcgtgtcag gcattgccat 4920 

cttcgccacc tacctgagat ttgccatgca catgaccgtg ggcggcgcag tgccatgggg 4980 
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tgaagtggct ggcactctcc tcttggtggt tggtggcgcg ctcggcatgg agatgtatgc 5040 

ccgctatgca cacaaagcca tctggcatga gtcgcctctg ggctggctgc tgcacaagag 5100 

ccaccacaca cctcgcactg gaccctttga agccaacgac ttgtttgcaa tcatcaatgg 5160 

actgcccgcc atgctcctgt gtacctttgg cttctggctg cccaacgtcc tgggggcggc 5220 

ctgctttgga gcggggctgg gcatcacgct atacggcatg gcatatatgt ttgtacacga 5280 

tggcctggtg cacaggcgct ttcccaccgg gcccatcgct ggcctgccct acatgaagcg 5340 

cctgacagtg gcccaccagc tacaccacag cggcaagtac ggtggcgcgc cctggggtat 5400 

gttcttgggt ccacaggagc tgcagcacat tccaggtgcg gcggaggagg tggagcgact 54 60 

ggtcctggaa ctggactggt ccaagcgggc gattgtgact gatagcgaga ctctgggtcg 5520 

atgttatctg cctcaacaat ggcttagaaa agaagaaaca gaacaaatac agcaaggcaa 5580 

cgcccgtagc ctaggtgatc aaagactgtt gggcttgtct ctgaagcttg taggaaaggc 5640 

agacgctatc atggtgagag ctaagaaggg cattgacaag ttgccggcaa actgtcaagg 5700 

cggtgtacga gctgcttgcc aagtatatgc tgcaattgga tctgtactca agcagcagaa 5760 

gacaacatat cctacaagag ctcatctaaa aggaagcgaa cgtgccaaga ttgctctgtt 5820 

gagtgtatac aacctctatc aatctgaaga caagcctgtg gctctccgtc aagctagaaa 5880 

gattaagagt ttttttgttg attagtgaat ttttgtttta tttatgtctg atagttcaat 5940 

aaagagacaa cacatacaat ataaaatcat tgtctttaaa tgttaattta gtagagtgta 6000 

aagcctgcat tttttttgta cgcataaaca atgaattcac cccgcttctg gtttttaaat 6060 

aattatgtca aactagggaa aattcttttt tttctcttcg ttcttttttt ggcttgttgt 6120 

ggagtcacag gcttgtcttc agattgatag aggttgtata cactcaacag agcaatcttg 6180 
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gcacgttcgc ttccttttag atgagctctt gtaggatatg ttgtcttctg ctgcttgagt 6240 

acagatccaa ttgcagcata tacttggcaa gcagctcgta caccgccttg acagtttgcc 6300 

ggcaacttgt caatgccctt cttagctctc accatgatag cgtctgcctt tcctacaagc 6360 

ttcagagaca agcccaacag tctttgatca cctaggctac gggcgttgcc ttgctgtatt 6420 

tgttctgttt cttcttttct aagccattgt tgaggcagat aacatcgacc caacatcctc 6480 

gagccatact acagcataaa aggatacgtt ttctttaaca gaaatttacc cttttgttat 6540 

cagcacatac aaaaaaaaag aaatttaaga tgagtaggac ttccattctc tcaaaaattt 6600 

tattcaatcc ataaatgaat tatttttgga caaaaaagaa agattatgcc tgattttctc 6660 

tatttttttt ttttttacaa ctccaccaat actttctagc ccagcttggc gtaatcatgg 6720 

tcatagctgt ttcctgtgtg aaattgttat ccgctcacaa ttccacacaa catacgagcc 6780 

ggaagcataa agtgtaaagc ctggggtgcc taatgagtga gctaactcac attaattgcg 6840 

ttgcgctcac tgcccgcttt ccagtcggga aacctgtcgt gccagctgca ttaatgaatc 6900 

ggccaacgcg cggggagagg cggtttgcgt attgggccaa agacaaaagg gcgacattca 6960 

accgattgag ggagggaagg taaatattga cggaaattat tcattaaagg tgaattatca 7020 

ccgtcaccga cttgagccat ttgggaatta gagccagcaa aatcaccagt agcaccatta 7080 

ccattagcaa ggccggaaac gtcaccaatg aaaccatcga tagcagcacc gtaatcagta 7140 

gcgacagaat caagtttgcc tttagcgtca gactgtagcg cgttttcatc ggcattttcg 7200 

gtcatagccc ccttattagc gtttgccatc ttttcataat caaaatcacc ggaaccagag 7260 

ccaccaccgg aaccgcctcc ctcagagccg ccaccctcag aaccgccacc ctcagagcca 7320 

ccaccctcag agccgccacc agaaccacca ccagagccgc cgccagcatt gacaggaggc 7380 

ccgatctagt aacatagatg acaccgcgcg cgataattta tcctagtttg cgcgctatat 7440 
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tttgttttct atcgcgtatt aaatgtataa ttgcgggact ctaatcataa aaacccatct 7500 

cataaataac gtcatgcatt acatgttaat tattacatgc ttaacgtaat tcaacagaaa 7560 

ttatatgata atcatcgcaa gaccggcaac aggattcaat cttaagaaac tttattgcca 7620 

aatgtttgaa cgatcgggga tcatccgggt ctgtggcggg aactccacga aaatatccga 7680 

acgcagcaag atatcgcggt gcatctcggt cttgcctggg cagtcgccgc cgacgccgtt 7740 

gatgtggacg ccgggcccga tcatattgtc gctcaggatc gtggcgttgt gcttgtcggc 7800 

cgttgctgtc gtaatgatat cggcaccttc gaccgcctgt tccgcagaga tcccgtgggc 7860 

gaagaactcc agcatgagat ccccgcgctg gaggatcatc cagccggcgt cccggaaaac 7920 

gattccgaag cccaaccttt catagaaggc ggcggtggaa tcgaaatctc gtgatggcag 7980 

gttgggcgtc gcttggtcgg tcatttcgaa ccccagagtc ccgctcagaa gaactcgtca 8040 

agaaggcgat agaaggcgat gcgctgcgaa tcgggagcgg cgataccgta aagcacgagg 8100 

aagcggtcag cccattcgcc gccaagctct tcagcaatat cacgggtagc caacgctatg 8160 

tcctgatagc ggtccgccac acccagccgg ccacagtcga tgaa'tccaga aaagcggcca 8220 

ttttccacca tgatattcgg caagcaggca tcgccatggg tcacgacgag atcatcgccg 8280 

tcgggcatgc gcgccttgag cctggcgaac agttcggctg gcgcgagccc ctgatgctct 8340 

tcgtccagat catcctgatc gacaagaccg gcttccatcc gagtacgtgc tcgctcgatg 8400 

cgatgtttcg cttggtggtc gaatgggcag gtagccggat caagcgtatg cagccgccgc 84 60 

attgcatcag ccatgatgga tactttctcg gcaggagcaa ggtgagatga caggagatcc 8520 

tgccccggca cttcgcccaa tagcagccag tcccttcccg cttcagtgac aacgtcgagc ■ 8580 

acagctgcgc aaggaacgcc cgtcgtggcc agccacgata gccgcgctgc ctcgtcctgc 8640 
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agttcattca gggcaccgga caggtcggtc ttgacaaaaa gaaccgggcg cccctgcgct 8700 

gacagccgga acacggcggc atcagagcag ccgattgtct gttgtgccca gtcatagccg 8760 

aatagcctct ccacccaagc ggccggagaa cctgcgtgca atccatcttg ttcaatcatg 8820 

cgaaacgatc cagatccggt gcagattatt tggattgaga gtgaatatga gactctaatt 8880 

ggataccgag gggaatttat ggaacgtcag tggagcattt ttgacaagaa atatttgcta 8940 

gctgatagtg accttaggcg acttttgaac gcgcaataat ggtttctgac gtatgtgctt 9000 

agctcattaa actccagaaa cccgcggctg agtggctcct tcaacgttgc ggttctgtca 9060 

gttccaaacg taaaacggct tgtcccgcgt catcggcggg ggtcataacg tgactccctt 9120 

aattctccgc tcatgatcag attgtcgttt cccgccttca gtttaaacta tcagtgtttg 9180 

acaggatata ttggcgggta aacctaagag aaaagagcgt ttattagaat aatcggatat 9240 

ttaaaagggc gtgaaaaggt ttatccgttc gtccatttgt atgtgcatgc caaccacagg 9300 

gttccccaga tctggcgccg gccagcgaga cgagcaagat tggccgccgc ccgaaacgat 9360 

ccgacagcgc gcccagcaca ggtgcgcagg caaattgcac caacgcatac agcgccagca 9420 

gaatgccata gtgggcggtg acgtcgttcg agtgaaccag atcgcgcagg aggcccggca 9480 

gcaccggcat aatcaggccg atgccgacag cgtcgagcgc gacagtgctc agaattacga 9540 

tcaggggtat gttgggtttc acgtctggcc tccggaccag cctccgctgg tccgattgaa 9600 

cgcgcggatt ctttatcact gataagttgg tggacatatt atgtttatca gtgataaagt 9660 

gtcaagcatg acaaagttgc agccgaatac agtgatccgt gccgccctgg acctgttgaa 9720 

cgaggtcggc gtagacggtc tgacgacacg caaactggcg gaacggttgg gggttcagca 9780 

gccggcgctt tactggcact tcaggaacaa gcgggcgctg ctcgacgcac tggccgaagc 984 0 

catgctggcg gagaatcata cgcattcggt gccgagagcc gacgacgact ggcgctcatt 9900 
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tctgatcggg aatgcccgca gcttcaggca ggcgctgctc gcctaccgcg atggcgcgcg 9960 

catccatgcc ggcacgcgac cgggcgcacc gcagatggaa acggccgacg cgcagcttcg 10020 

cttcctctgc gaggcgggtt tttcggccgg ggacgccgtc aatgcgctga tgacaatcag 10080 

ctacttcact gttggggccg tgcttgagga gcaggccggc gacagcgatg ccggcgagcg 10140 

cggcggcacc gttgaacagg ctccgctctc gccgctgttg cgggccgcga tagacgcctt 10200 

cgacgaagcc ggtccggacg cagcgttcga gcagggactc gcggtgattg tcgatggatt 10260 

ggcgaaaagg aggctcgttg tcaggaacgt tgaaggaccg agaaagggtg acgattgatc 10320 

aggaccgctg ccggagcgca acccactcac tacagcagag ccatgtagac aacatcccct 10380 

ccccctttcc accgcgtcag acgcccgtag cagcccgcta cgggcttttt catgccctgc 10440 

cctagcgtcc aagcctcacg gccgcgctcg gcctctctgg cggccttctg gcgctcttcc 10500 

gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 10560 

cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 10620 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 10680 

cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 10740 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 10800 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 10860 

gcgcttttcc gctgcataac cctgcttcgg ggtcattata gcgatttttt cggtatatcc 10920 

atcctttttc gcacgatata caggattttg ccaaagggtt cgtgtagact ttccttggtg 10980 

tatccaacgg cgtcagccgg gcaggatagg tgaagtaggc ccacccgcga gcgggtgttc 11040 

cttcttcact gtcccttatt cgcacctggc ggtgctcaac gggaatcctg ctctgcgagg 11100 
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ctggccggct accgccggcg taacagatga gggcaagcgg atggctgatg aaaccaagcc 11160 

aaccaggaag ggcagcccac ctatcaaggt gtactgcctt ccagacgaac gaagagcgat 11220 

tgaggaaaag gcggcggcgg ccggcatgag cctgtcggcc tacctgctgg ccgtcggcca 11280 

gggctacaaa atcacgggcg tcgtggacta tgagcacgtc cgcgagctgg cccgcatcaa 11340 

tggcgacctg ggccgcctgg gcggcctgct gaaactctgg ctcaccgacg acccgcgcac 11400 

ggcgcggttc ggtgatgcca cgatcctcgc cctgctggcg aagatcgaag agaagcagga 11460 

cgagcttggc aaggtcatga tgggcgtggt ccgcccgagg gcagagccat gactttttta 11520 

gccgctaaaa cggccggggg gtgcgcgtga ttgccaagca cgtccccatg cgctccatca 11580 

agaagagcga cttcgcggag ctggtgaagt acatcaccga cgagcaaggc aagaccgagc 11640 

gcctttgcga cgctcaccgg gctggttgcc ctcgccgctg ggctggcggc cgtctatggc 11700 

cctgcaaacg cgccagaaac gccgtcgaag ccgtgtgcga gacaccgcgg ccgccggcgt 117 60 

tgtggatacc tcgcggaaaa cttggccctc actgacagat gaggggcgga cgttgacact 11820 

tgaggggccg actcacccgg cgcggcgttg acagatgagg ggcaggctcg atttcggccg 11880 

gcgacgtgga gctggccagc ctcgcaaatc ggcgaaaacg cctgatttta cgcgagtttc 11940 

ccacagatga tgtggacaag cctggggata agtgccctgc ggtattgaca cttgaggggc 12000 

gcgactactg acagatgagg ggcgcgatcc ttgacacttg aggggcagag tgctgacaga 12060 

tgaggggcgc acctattgac atttgagggg ctgtccacag gcagaaaatc cagcatttgc 12120 

aagggtttcc gcccgttttt cggccaccgc taacctgtct tttaacctgc ttttaaacca 12180 

atatttataa accttgtttt taaccagggc tgcgccctgt gcgcgtgacc gcgcacgccg 12240 

aaggggggtg cccccccttc tcgaaccctc ccggcccgct aacgcgggcc tcccatcccc 12300 

ccaggggctg cgcccctcgg ccgcgaacgg cctcacccca aaaatggcag cgctggcagt 12360 
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ccttgccatt gccgggatcg gggcagtaac gggatgggcg atcagcccga gcgcgacgcc 12420 

cggaagcatt gacgtgccgc aggtgctggc atcgacattc agcgaccagg tgccgggcag 12480 

tgagggcggc ggcctgggtg gcggcctgcc cttcacttcg gccgtcgggg cattcacgga 12540 

cttcatggcg gggccggcaa tttttacctt gggcattctt ggcatagtgg tcgcgggtgc 12600 

cgtgctcgtg ttcgggggtg cgataaaccc agcgaaccat ttgaggtgat aggtaagatt 12660 

ataccgaggt atgaaaacga gaattggacc tttacagaat tactctatga agcgccatat 12720 

ttaaaaagct accaagacga agaggatgaa gaggatgagg aggcagattg ccttgaatat 12780 

attgacaata ctgataagat aatatatctt ttatatagaa gatatcgccg tatgtaagga 12840 

tttcaggggg caaggcatag gcagcgcgct tatcaatata tctatagaat gggcaaagca 12900 

taaaaacttg catggactaa tgcttgaaac ccaggacaat aaccttatag cttgtaaatt 12960 

ctatcataat tgggtaatga ctccaactta ttgatagtgt tttatgttca gataatgccc 13020 

gatgactttg tcatgcagct ccaccgattt tgagaacgac agcgacttcc gtcccagccg 13080 

tgccaggtgc tgcctcagat tcaggttatg ccgctcaatt cgctgcgtat atcgcttgct 13140 

gattacgtgc agctttccct tcaggcggga ttcatacagc ggccagccat ccgtcatcca 13200 

tatcaccacg tcaaagggtg acagcaggct cataagacgc cccagcgtcg ccatagtgcg 13260 

ttcaccgaat acgtgcgcaa caaccgtctt ccggagactg tcatacgcgt aaaacagcca 13320 

gcgctggcgc gatttagccc cgacatagcc ccactgttcg tccatttccg cgcagacgat 13380 

gacgtcactg cccggctgta tgcgcgaggt taccgactgc ggcctgagtt ttttaagtga 13440 

cgtaaaatcg tgttgaggcc aacgcccata atgcgggctg ttgcccggca tccaacgcca 13500 

ttcatggcca tatcaatgat tttctggtgc gtaccgggtt gagaagcggt gtaagtgaac 13560 
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tgcagttgcc atgttttacg gcagtgagag cagagatagc gctgatgtcc ggcggtgctt 13620 

ttgccgttac gcaccacccc gtcagtagct gaacaggagg gacagctgat agacacagaa 13680 

gccactggag cacctcaaaa acaccatcat acactaaatc agtaagttgg cagcatcacc 13740 

cataattgtg gtttcaaaat cggctccgtc gatactatgt tatacgccaa ctttgaaaac 13800 

aactttgaaa aagctgtttt ctggtattta aggttttaga atgcaaggaa cagtgaattg 13860 

gagttcgtct tgttataatt agcttcttgg ggtatcttta aatactgtag aaaagaggaa 13920 

ggaaataata aatggctaaa atgagaatat caccggaatt gaaaaaactg atcgaaaaat 13980 

accgctgcgt aaaagatacg gaaggaatgt ctcctgctaa ggtatataag ctggtgggag 14040 

aaaatgaaaa cctatattta aaaatgacgg acagccggta taaagggacc acctatgatg 14100 

tggaacggga aaaggacatg atgctatggc tggaaggaaa gctgcctgtt ccaaaggtcc 14160 

tgcactttga acggcatgat ggctggagca atctgctcat gagtgaggcc gatggcgtcc 14220 

tttgctcgga agagtatgaa gatgaacaaa gccctgaaaa gattatcgag ctgtatgcgg 14280 

agtgcatcag gctctttca'c tccatcgaca tatcggattg tccctatacg aatagcttag 14340 

acagccgctt agccgaattg gattacttac tgaataacga tctggccgat gtggattgcg 14400 

aaaactggga agaagacact ccatttaaag atccgcgcga gctgtatgat tttttaaaga 14460 

cggaaaagcc cgaagaggaa cttgtctttt cccacggcga cctgggagac agcaacatct 14520 

ttgtgaaaga tggcaaagta agtggcttta ttgatcttgg gagaagcggc agggcggaca 14580 

agtggtatga cattgccttc tgcgtccggt cgatcaggga ggatatcggg gaagaacagt 14640 

atgtcgagct attttttgac ttactggggci tcaagcctga ttgggagaaa ataaaatatt 14700 

atat.tttact ggatgaattg ttttagtacc tagatgtggc gcaacgatgc cggcgacaag 14760 

caggagcgca ccgacttctt ccgcatcaag tgttttggct ctcaggccga ggcccacggc 14820 
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aagtatttgg gcaaggggtc gctggtattc gtgcagggca agattcggaa taccaagtac 14880 

gagaaggacg gccagacggt ctacgggacc gacttcattg ccgataaggt ggattatctg 14 940 

gacaccaagg caccaggcgg gtcaaatcag gaataagggc acattgcccc ggcgtgagtc 15000 

ggggcaatcc cgcaaggagg gtgaatgaat cggacgtttg accggaaggc atacaggcaa 15060 

gaactgatcg acgcggggtt ttccgccgag gatgccgaaa ccatcgcaag ccgcaccgtc 15120 

atgcgtgcgc cccgcgaaac cttccagtcc gtcggctcga tggtccagca agctacggcc 15180 

aagatcgagc gcgacagcgt gcaactggct ccccctgccc tgcccgcgcc atcggccgcc 15240 

•gtggagcgtt cgcgtcgtct cgaacaggag gcggcaggtt tggcgaagtc gatgaccatc 15300 

gacacgcgag .gaactatgac gaccaagaag cgaaaaaccg ccggcgagga cctggcaaaa 15360 

caggtcagcg aggccaagca ggccgcgttg ctgaaacaca cgaagcagca gatcaaggaa 15420 

atgcagcttt ccttgttcga tattgcgccg tggccggaca cgatgcgagc gatgccaaac 15480 

gacacggccc gctctgccct gttcaccacg cgcaacaaga aaatcccgcg cgaggcgctg 15540 

caaaacaagg tcattttcca cgtcaacaag gacgtgaaga tcacctacac cggcgtcgag 15600 

ctgcgggccg acgatgacga actggtgtgg cagcaggtgt tggagtacgc gaagcgcacc 15660 

cctatcggcg agccgatcac cttcacgttc tacgagcttt gccaggacct gggctggtcg 15720 

atcaatggcc ggtattacac gaaggccgag gaatgcctgt cgcgcctaca ggcgacggcg 15780 

atgggcttca cgtccgaccg cgttgggcac ctggaatcgg tgtcgctgct gcaccgcttc 15840 

cgcgtcctgg accgtggcaa gaaaacgtcc cgttgccagg tcctgatcga cgaggaaatc 15900 

gtcgtgctgt ttgctggcga ccactacacg aaattcatat gggagaagta ccgcaagctg ■ 15960 

tcgccgacgg cccgacggat gttcgactat ttcagctcgc accgggagcc gtacccgctc 16020 
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aagctggaaa ccttccgcct catgtgcgga tcggattcca cccgcgtgaa gaagtggcgc 16080 

gagcaggtcg gcgaagcctg cgaagagttg cgaggcagcg gcctggtgga acacgcctgg 16140 

gtcaatgatg acctggtgca ttgcaaacgc tagggccttg tggggtcagt tccggctggg 16200 

ggttcagcag ccagcgcttt actggcattt caggaacaag cgggcactgc tcgacgcact 16260 

tgcttcgctc agtatcgctc gggacgcacg gcgcgctcta cgaactgccg ataaacagag 16320 

gattaaaatt gacaattgtg attaaggctc agattcgacg gcttggagcg gccgacgtgc 16380 

aggatttccg cgagatccga ttgtcggccc tgaagaaagc tccagagatg ttcgggtccg 16440 

tttacgagca cgaggagaaa aagcccatgg aggcgttcgc tgaacggttg cgagatgccg 16500 

tggcattcgg cgcctacatc gacggcgaga tcattgggct gtcggtcttc aaacaggagg 16560 

acggccccaa ggacgctcac aaggcgcatc tgtccggcgt tttcgtggag cccgaacagc 16620 

gaggccgagg ggtcgccggt atgctgctgc gggcgttgcc ggcgggttta ttgctcgtga 16680 

tgatcgtccg acagattcca acgggaatct ggtggatgcg catcttcatc ctcggcgcac 16740 

ttaatatttc gctattctg-g agcttgttgt ttatttcggt ctaccgcctg ccgggcgggg 16800 

tcgcggcgac ggtaggcgct gtgcagccgc tgatggtcgt gttcatctct gccgctctgc 16860 

taggtagccc gatacgattg atggcggtcc tgggggctat ttgcggaact gcgggcgtgg 16920 

cgctgttggt gttgacacca aacgcagcgc tagatcctgt cggcgtcgca gcgggcctgg 16980 

cgggggcggt ttccatggcg ttcggaaccg tgctgacccg caagtggcaa cctcccgtgc 17040 

ctctgctcac ctttaccgcc tggcaactgg cggccggagg acttctgctc gttccagtag 17100 

ctttagtgtt tgatccgcca atcccgatgc ctacaggaac caatgttctc ggcctggcgt 17160 

ggctcggcct gatcggagcg ggtttaacct acttcctttg gttccggggg atctcgcgac 17220 

tcgaacctac agttgtttcc ttactgggct ttctcagccc cagatctggg gtcg;atcagc 17280 
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cggggatgca tcaggccgac agtcggaact tcgggtcccc gacctgtacc attcggtgag 17340 

caatggatag gggagttgat atcgtcaacg ttcacttcta aagaaatagc gccactcagc 17400 

ttcctcagcg gctttatcca gcgatttcct attatgtcgg catagttctc aagatcgaca 174 60 

gcctgtcacg gttaagcgag aaatgaataa gaaggctgat aattcggatc tctgcgaggg 17520 

agatgatatt tgatcacagg cagcaacgct ctgtcatcgt tacaatcaac atgctaccct 17580 

ccgcgagatc atccgtgttt caaacccggc agcttagttg ccgttcttcc gaatagcatc 17640 

ggtaacatga gcaaagtctg ccgccttaca acggctctcc cgctgacgcc gtcccggact 17700 

gatgggctgc ctgtatcgag tggtgatttt gtgccgagct gccggtcggg gagctgttgg 17760 

ctggctggtg gcaggatata ttgtggtgta aacaaattga cgcttagaca acttaataac 17820 

acattgcgga cgtttttaat gtactggggt ggtttttctt ttcaccagtg agacgggcaa 17880 

cagctgattg cccttcaccg cctggccctg agagagttgc agcaagcggt ccacgctggt 17940 

ttgccccagc aggcgaaaat cctgtttgat ggtggttccg aaatcggcaa aatcccttat 18000 

aaatcaaaag aatagcccga gatagggttg agtgttgttc cagtttggaa caagagtcca 18060 

ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc 18120 

ccactacgtg aaccatcacc caaatcaagt tttttggggt cgaggtgccg taaagcacta 18180 

aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg 18240 

gcgagaaagg aagggaagaa agcgaaagga gcgggcgcca ttcaggctgc gcaactgttg 18300 

ggaagggcga tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc 18360 

tgcaaggcga ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac 18420 

ggccagtgaa ttcgagctcg gtacccggg 18449 
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misc_f eature 
(10264) . . (10264) 
n is a, c, q, or t 

mi sc_f eature 
(10472) . . (10472) 
n is a, c, g, or t 

mis cofeature 
(10563) . . (10563) 
n is a, c, q, or t 



ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 



60 



aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 



120 



aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 



180 



ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg tggagctggc 



240 



cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 



300 



caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 



360 



gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 



420 



tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 
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ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 

tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 

atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 840 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 
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gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 1740 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 

tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 2460 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 2760 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 
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cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 

ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 
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accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4 320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4 380 

gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4 500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4 560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggctcgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4 800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 
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cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 

cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 62.40 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 6480 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 



BASF AG 303/365 January 08, 2004 
BASF NAE 877/03 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg • 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 

cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 
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ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 

tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 
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cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 

ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 9840 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 
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gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 

canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 

tatgctgtcg aagctgcagt caatcagcgt caaggcccgc cgcgttgaac tagcccgcga • 114 60 

catcacgcgg cccaaagtct gcctgcatgc tcagcggtgc tcgttagttc ggctgcgagt 11520 



BASF AG 307/365 January 08, 2004 

BASF NAE 877/03 

ggcagcacca cagacagagg aggcgctggg aaccgtgcag gctgccggcg cgggcgatga 11580 

gcacagcgcc gatgtagcac tccagcagct tgaccgggct atcgcagagc gtcgtgcccg 11640 

gcgcaaacgg gagcagctgt cataccaggc tgccgccatt gcagcatcaa ttggcgtgtc 11700 

aggcattgcc atcttcgcca cctacctgag atttgccatg cacatgaccg tgggcggcgc 11760 

agtgccatgg ggtgaagtgg ctggcactct cctcttggtg gttggtggcg cgctcggcat 11820 

ggagatgtat gcccgctatg cacacaaagc catctggcat gagtcgcctc tgggctggct 11880 

gctgcacaag agccaccaca. cacctcgcac tggacccttt gaagccaacg acttgtttgc 11940 

aatcatcaat ggactgcccg ccatgctcct gtgtaccttt ggcttctggc tgcccaacgt 12000 

cctgggggcg gcctgctttg gagcggggct gggcatcacg ctatacggca tggcatatat 12060 

gtttgtacac gatggcctgg tgcacaggcg ctttcccacc gggcccatcg ctggcctgcc 12120 

ctacatgaag cgcctgacag tggcccacca gctacaccac agcggcaagt acggtggcgc 12180 

gccctggggt atgttcttgg gtccacagga gctgcagcac attccaggtg cggcggagga 1224 0 

ggtggagcga ctggtcctgg aactggactg gtccaagcgg tagaagcttg agattaaaat 12300 

agataaggaa aagaaagtga aaagaaattc ggaagcatgg cacattcttc tttttataaa 12360 

tacatgcctg actttctttt tccatcgata tgatatatgc atatgataga tatacaagca 12420 

atcttcttca aggagtttga aattttgtcc tccaggagca aaaaaaagtt tttttttata 12480 

catgtttgta cacaagaata gttaccaatt tgctttggtc ttacgtgctg caagtttata 12540 

tcgttttcaa tttctttgtc tttacatttt ctttgtcctt tatctttcct catttagtct 12600 

ttgggagaat taggaaaagg gagcggaaag gtaagaaatg cttgcgtatt ttactaattc 12660 

ggcaaacatc caatttggca aacagcagcc tgtgcaacgc tctcgagatg acagtatctt 12720 

tgattacact ctaaatctcg atgacccgac caaaaagagc gaacaaagaa ataatcttgt 12780 
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gcattcgaat atgatggaag attttttccc ccttattcta aatgttgaca tagcgtgtat 12840 

gttatataaa caaaaagaaa ttgtacaaac tttcttttct tctcttttta ttttatctct 12900 

atgatccagt tagaacaacc actcagtcat caagcaaaac tgactccagt actgagaagt 12960 

aaatctcagt ttaaggggct tttcattgct attgtcattg ttagcgcatg ggtcattagc 13020 

ctgagtttat tactttccct tgacatctca aagctaaaat tttggatgtt attgcctgtt 13080 

atactatggc aaacattttt atatacggga ttatttatta catctcatga tgccatgcat 13140 

ggcgtagtat ttccccaaaa caccaagatt aatcatttga ttggaacatt gaccctatcc 13200 

ctttatggtc ttttaccata tcaaaaacta ttgaaaaaac attggttaca ccaccacaat 13260 

ccagcaagct caatagaccc ggattttcac aatggtaaac accaaagttt ctttgcttgg 13320 

tattttcatt ttatgaaagg ttactggagt tgggggcaaa taattgcgtt gactattatt 13380 

tataactttg ctaaatacat actccatatc ccaagtgata atctaactta cttttgggtg 13440 

ctaccctcgc ttttaagttc attacaatta ttctattttg gtactttttt accccatagt 13500 

gaaccaatag ggggttatgt tcagcctcat tgtgcccaaa caattagccg tcctatttgg 13560 

tggtcattta tcacgtgcta tcattttggc taccacgagg aacatcacga atatcctcat 13620 

atttcttggt ggcagttacc agaaatttac aaagcaaaat agaagcttgg cgtaatcatg 13680 

gtcatagctg tttcctgtgt gaaattgtta tccgctcaca attccacaca acatacgagc 13740 

cggaagcata aagtgtaaag cctggggtgc ctaatgagtg agctaactca cattaattgc 13800 

gttgcgctca ctgcccgctt tccagtcggg aaacctgtcg tgccagctgc attaatgaat 13860 

cggccaacgc gcggggagag gcggtttgcg tattgggcca aagacaaaag ggcgacattc 13920 

aaccgattga gggagggaag gtaaatattg acggaaatta ttcattaaag gtgaattatc 13980 
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accgtcaccg acttgagcca tttgggaatt agagccagca aaatcaccag tagcaccatt 14040 

accattagca aggccggaaa cgtcaccaat gaaaccatcg atagcagcac cgtaatcagt 14100 

agcgacagaa tcaagtttgc ctttagcgtc agactgtagc gcgttttcat cggcattttc 14160 

ggtcatagcc cccttattag cgtttgccat cttttcataa tcaaaatcac cggaaccaga 14220 

gccaccaccg gaaccgcctc cctcagagcc gccaccctca gaaccgccac cctcagagcc 14280 

accaccctca gagccgccac cagaaccacc accagagccg ccgccagcat tgacaggagg 14340 

cccgatctag taacatagat gacaccgcgc gcgataattt atcctagttt gcgcgctata 14 400 

ttttgttttc tatcgcgtat taaatgtata attgcgggac tctaatcata aaaacccatc 14460 

tcataaataa cgtcatgcat tacatgttaa ttattacatg cttaacgtaa ttcaacagaa 14520 

attatatgat aatcatcgca agaccggcaa caggattcaa tcttaagaaa ctttattgcc 14580 

aaatgtttga acgatcgggg atcatccggg tctgtggcgg gaactccacg aaaatatccg 14640 

aacgcagcaa gatatcgcgg tgcatctcgg tcttgcctgg gcagtcgccg ccgacgccgt 14700 

tgatgtggac gccgggcccg atcatattgt cgctcaggat cgtggcgttg tgcttgtcgg 14760 

ccgttgctgt cgtaatgata tcggcacctt cgaccgcctg ttccgcagag atcccgtggg 14820 

cgaagaactc cagcatgaga tccccgcgct ggaggatcat ccagccggcg tcccggaaaa 14880 

cgattccgaa gcccaacctt tcatagaagg cggcggtgga atcgaaatct cgtgatggca 14940 

ggttgggcgt cgcttggtcg gtcatttcga accccagagt cccgctcaga agaactcgtc 15000 

aagaaggcga tagaaggcga tgcgctgcga atcgggagcg gcgataccgt aaagcacgag 15060 

gaagcggtca gcccattcgc cgccaagctc ttcagcaata tcacgggtag ccaacgctat 15120 

gtcctgatag cggtccgcca cacccagccg gccacagtcg atgaatccag aaaagcggcc 15180 

attttccacc atgatattcg gcaagcaggc atcgccatgg gtcacgacga gatcatcgcc 15240 
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gtcgggcatg cgcgccttga gcctggcgaa cagttcggct ggcgcgagcc cctgatgctc 15300 

ttcgtccaga tcatcctgat cgacaagacc ggcttccatc cgagtacgtg ctcgctcgat 15360 

gcgatgtttc gcttggtggt cgaatgggca ggtagccgga tcaagcgtat gcagccgccg 15420 

cattgcatca gccatgatgg atactttctc ggcaggagca aggtgagatg acaggagatc 15480 

ctgccccggc acttcgccca atagcagcca gtcccttccc gcttcagtga caacgtcgag 15540 

cacagctgcg caaggaacgc ccgtcgtggc cagccacgat agccgcgctg cctcgtcctg 15600 

cagttcattc agggcaccgg acaggtcggt cttgacaaaa agaaccgggc gcccctgcgc 15660 

tgacagccgg aacacggcgg catcagagca gccgattgtc tgttgtgccc agtcatagcc 15720 

gaatagcctc tccacccaag cggccggaga acctgcgtgc aatccatctt gttcaatcat 15780 

gcgaaacgat ccagatccgg tgcagattat ttggattgag agtgaatatg agactctaat 15840 

tggataccga ggggaattta tggaacgtca gtggagcatt tttgacaaga aatatttgct 15900 

agctgatagt gaccttaggc gacttttgaa cgcgcaataa tggtttctga cgtatgtgct 15960 

tagctcatta aactccagaa acccgcggct gagtggctcc ttcaacgttg cggttctgtc 16020 

agttccaaac gtaaaacggc ttgtcccgcg tcatcggcgg gggtcataac gtgactccct 16080 

taattctccg ctcatgatca gattgtcgtt tcccgccttc agtttaaact atcagtgttt 16140 

gacaggatat attggcgggt aaacctaaga gaaaagagcg tttattagaa taatcggata 16200 

tttaaaaggg cgtgaaaagg tttatccgtt cgtccatttg tatgtgcatg ccaaccacag 16260 

ggttccccag atctggcgcc ggccagcgag acgagcaaga ttggccgccg cccgaaacga 16320 

tccgacagcg cgcccagcac aggtgcgcag gcaaattgca ccaacgcata cagcgccagc 16380 

agaatgccat agtgggcggt gacgtcgttc gagtgaacca gatcgcgcag gaggcccggc 16440 
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agcaccggca taatcaggcc gatgccgaca gcgtcgagcg cgacagtgct cagaattacg 
atcaggggta tgttgggttt cacgtctggc ctccggacca gcctccgctg gtccgattga 
acgcgcggat tctttatcac tgataagttg gtggacatat tatgtttatc agtgataaag 
tgtcaagcat gacaaagttg cagccgaata cagtgatccg tgccgccctg gacctgttga 
acgaggtcgg cgtagacggt ctgacgacac gcaaactggc ggaacggttg ggggttcagc 
agccggcgct ttactggcac ttcaggaaca agcgggcgct gctcgacgca ctggccgaag 
ccatgctggc ggagaatcat acgcattcgg tgccgagagc cgacgacgac tggcgctcat 
ttctgatcgg gaatgcccgc agcttcaggc aggcgctgct cgcctaccgc gatggcgcgc 
gcatccatgc cggcacgcga ccgggcgcac cgcagatgga aacggccgac gcgcagcttc 
gcttcctctg cgaggcgggt ttttcggccg gggacgccgt caatgcgctg atgacaatca 
gctacttcac tgttggggcc gtgcttgagg agcaggccgg cgacagcgat gccggcgagc 
gcggcggcac cgttgaacag gctccgctct cgccgctgtt gcgggccgcg atagacgcct 
tcgacgaagc cggtccgga'c gcagcgttcg agcagggact cgcggtgatt gtcgatggat 
tggcgaaaag gaggctcgtt gtcaggaacg ttgaaggacc gagaaagggt gacgattgat 
caggaccgct gccggagcgc aacccactca ctacagcaga gccatgtaga caacatcccc 
tccccctttc caccgcgtca gacgcccgta gcagcccgct acgggctttt tcatgccctg 
ccctagcgtc caagcctcac ggccgcgctc ggcctctctg gcggccttct ggcgctcttc 
cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc 
tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat 
gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt 
ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg 
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aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc 17760 

tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt 17820 

ggcgcttttc cgctgcataa ccctgcttcg gggtcattat agcgattttt tcggtatatc 17880 

catccttttt cgcacgatat acaggatttt gccaaagggt tcgtgtagac tttccttggt 17940 

gtatccaacg gcgtcagccg ggcaggatag gtgaagtagg cccacccgcg agcgggtgtt 18000 

ccttcttcac tgtcccttat tcgcacctgg cggtgctcaa cgggaatcct gctctgcgag 18060 

gctggccggc taccgccggc gtaacagatg agggcaagcg gatggctgat gaaaccaagc 18120 

caaccaggaa gggcagccca cctatcaagg tgtactgcct tccagacgaa cgaagagcga 18180 

ttgaggaaaa ggcggcggcg gccggcatga gcctgtcggc ctacctgctg gccgtcggcc 18240 

agggctacaa aatcacgggc gtcgtggact atgagcacgt ccgcgagctg gcccgcatca 18300 

atggcgacct gggccgcctg ggcggcctgc tgaaactctg gctcaccgac gacccgcgca 18360 

cggcgcggtt cggtgatgcc acgatcctcg ccctgctggc gaagatcgaa gagaagcagg 18420 

acgagcttgg caaggtcatg atgggcgtgg tccgcccgag ggcagagcca tgactttttt 18480 

agccgctaaa acggccgggg ggtgcgcgtg attgccaagc acgtccccat gcgctccatc 18540 

aagaagagcg acttcgcgga gctggtgaag tacatcaccg acgagcaagg caagaccgag 18600 

cgcctttgcg acgctca 18617 

<210> 51 
<211> 18333 
<212> DNA 
<213> Artificial 

<220> 

<223> Plasmid 
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<220> 

<221> misc_f eature 

<222> (10264) . . (10264) 

<223> n is a, c, g, or t 



<220> 

<221> misc_f eature 

<222> (10472) . . (10472) 

<223> n is a, c, g, or t 



<220> 

<221> misc_feature 
<222> (10563) . . (10563) 
<223> n is a, c, g, or t 

<400> 51 

ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca aacgcgccag 60 

aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga tacctcgcgg 120 

aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg gccgactcac 180 

ccggcgcggc gttgacagart gaggggcagg ctcgatttcg gccggcgacg tggagctggc 240 

cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag atgatgtgga 300 

caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact actgacagat 360 

gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg gcgcacctat 420 

tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt ttccgcccgt 480 

ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt ataaaccttg 540 

tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg ggtgcccccc 600 

cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg gctgcgcccc 660 



tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc cattgccggg 720 
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atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag cattgacgtg 780 

ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg cggcggcctg 84 0 

ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat ggcggggccg 900 

gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct cgtgttcggg 960 

ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg aggtatgaaa 1020 

acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa agctaccaag 1080 

acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac aatactgata 1140 

agataatata tcttttatat agaagatatc gccgtatgta aggatttcag ggggcaaggc 1200 

ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa cttgcatgga 1260 

ctaatgcttg aaacccagga ca'ataacctt atagcttgta aattctatca taattgggta 1320 

atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac tttgtcatgc 1380 

agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag gtgctgcctc 1440 

agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac gtgcagcttt 1500 

cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac cacgtcaaag 1560 

ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc gaatacgtgc 1620 

gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg gcgcgattta 1680 

gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc actgcccggc 174 0 

tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa atcgtgttga 1800 

ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg gccatatcaa 1860 

tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt tgccatgttt 1920 
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tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg ttacgcacca 1980 

ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact ggagcacctc 2040 

aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat tgtggtttca 2100 

aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt gaaaaagctg 2160 

ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc gtcttgttat 2220 

aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat aataaatggc 2280 

taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct gcgtaaaaga 2340 

tacggaagga atgtctcctg ctaaggtata taagctggtg ggagaaaatg aaaacctata 2400 

tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac gggaaaagga 24 60 

catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact ttgaacggca 2520 

tgatggctgg . agcaatctgc tcatgagtga ggccgatggc gtcctttgct cggaagagta 2580 

tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca tcaggctctt 2640 

tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc gcttagccga 2700 

attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact gggaagaaga 27 60 

cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa agcccgaaga 2820 

ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga aagatggcaa 2880 

agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt atgacattgc 2940 

cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg agctattttt 3000 

tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt tactggatga 3060 

attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag cgcaccgact 3120 

tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat ttgggcaagg 3180 
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ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag gacggccaga 3240 

cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc aaggcaccag 3300 

gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca atcccgcaag 3360 

gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg atcgacgcgg 3420 

ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt gcgccccgcg 3480 

aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc gagcgcgaca 3540 

gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag cgttcgcgtc 3600 

gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg cgaggaacta 3660 

tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc agcgaggcca 3720 

agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag ctttccttgt 3780 

tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg gcccgctctg 3840 

ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac aaggtcattt 3900 

tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg gccgacgatg 3960 

acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc ggcgagccga 4 020 

tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat ggccggtatt 4080 

acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc ttcacgtccg 4140 

accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc ctggaccgtg 4200 

gcaagaaaac gtcccgttgc caggtcctga tcgacgagga aatcgtcgtg ctgtttgctg 4260 

gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg acggcccgac 4320 

ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg gaaaccttcc 4380 
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gcctcatgtg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag gtcggcgaag 4440 

cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat gatgacctgg 4500 

tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca gcagccagcg 4560 

ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc gctcagtatc 4 620 

gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa aattgacaat 4 680 

tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt tccgcgagat 4740 

ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg agcacgagga 4800 

gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat tcggcgccta 4860 

catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc ccaaggacgc 4 920 

tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc gaggggtcgc 4 980 

cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg tccgacagat 5040 

tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata tttcgctatt 5100 

ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg cgacggtagg 5160 

cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta gcccgatacg 5220 

attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt tggtgttgac 5280 

accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg cggtttccat 5340 

ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc tcacctttac 5400 

cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag tgtttgatcc 54 60 

gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg gcctgatcgg 5520 

agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac ctacagttgt 5580 

ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga tgcatcaggc 5640 
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cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg ataggggagt 5700 

tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc agcggcttta 5760 

tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt cacggttaag 5820 

cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga tatttgatca 5880 

caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga gatcatccgt 5940 

gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac atgagcaaag 6000 

tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg ctgcctgtat 6060 

cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct ggtggcagga 6120 

tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg cggacgtttt 6180 

taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg attgcccttc 6240 

accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc cagcaggcga 6300 

aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca aaagaatagc 6360 

ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta aagaacgtgg 6420 

actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta cgtgaaccat 64 80 

cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg aaccctaaag 6540 

ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga aaggaaggga 6600 

agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg gcgatcggtg 6660 

cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag gcgattaagt 6720 

tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag tgaattcgag " 6780 

ctcggtaccc ggggatcttt cgacactgaa atacgtcgag cctgctccgc ttggaagcgg 6840 
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cgaggagcct cgtcctgtca caactaccaa catggagtac gataagggcc agttccgcca 6900 

gctcattaag agccagttca tgggcgttgg catgatggcc gtcatgcatc tgtacttcaa 6960 

gtacaccaac gctcttctga tccagtcgat catccgctga aggcgctttc gaatctggtt 7020 

aagatccacg tcttcgggaa gccagcgact ggtgacctcc agcgtccctt taaggctgcc 7080 

aacagctttc tcagccaggg ccagcccaag accgacaagg cctccctcca gaacgccgag 7140 

aagaactgga ggggtggtgt caaggaggag taagctcctt attgaagtcg gaggacggag 7200 

cggtgtcaag aggatattct tcgactctgt attatagata agatgatgag gaattggagg 7260 

tagcatagct tcatttggat ttgctttcca ggctgagact ctagcttgga gcatagaggg 7320 

tcctttggct ttcaatattc tcaagtatct cgagtttgaa cttattccct gtgaaccttt 7380 

tattcaccaa tgagcattgg aatgaacatg aatctgagga ctgcaatcgc catgaggttt 7440 

tcgaaataca tccggatgtc gaaggcttgg ggcacctgcg ttggttgaat ttagaacgtg 7500 

gcactattga tcatccgata gctctgcaaa gggcgttgca caatgcaagt caaacgttgc 7560 

tagcagttcc aggtggaatg ttatgatgag cattgtatta aatcaggaga tatagcatga 7620 

tctctagtta gctcaccaca aaagtcagac ggcgtaacca aaagtcacac aacacaagct 7 680 

gtaaggattt cggcacggct acggaagacg gagaagccac cttcagtgga ctcgagtacc 7740 

atttaattct atttgtgttt gatcgagacc taatacagcc cctacaacga ccatcaaagt 7800 

cgtatagcta ccagtgagga agtggactca aatcgacttc agcaacatct cctggataaa 7860 

ctttaagcct aaactataca gaataagata ggtggagagc ttataccgag ctcccaaatc 7920 

tgtccagatc atggttgacc ggtgcctgga tcttcctata gaatcatcct tattcgttga 7980 

cctagctgat tctggagtga cccagagggt catgacttga gcctaaaatc cgccgcctcc 8040 

accatttgta gaaaaatgtg acgaactcgt gagctctgta cagtgaccgg tgactctttc 8100 
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tggcatgcgg agagacggac ggacgcagag agaagggctg agtaataagc cactggccag 8160 

acagctctgg cggctctgag gtgcagtgga tgattattaa tccgggaccg gccgcccctc 8220 

cgccccgaag tggaaaggct ggtgtgcccc tcgttgacca agaatctatt gcatcatcgg 8280 

agaatatgga gcttcatcga atcaccggca gtaagcgaag gagaatgtga agccaggggt 8340 

gtatagccgt cggcgaaata gcatgccatt aacctaggta cagaagtcca attgcttccg 8400 

atctggtaaa agattcacga gatagtacct tctccgaagt aggtagagcg agtacccggc 84 60 

gcgtaagctc cctaattggc ccatccggca tctgtagggc gtccaaatat cgtgcctctc 8520 

ctgctttgcc cggtgtatga aaccggaaag gccgctcagg agctggccag cggcgcagac 8580 

cgggaacaca agctggcagt cgacccatcc ggtgctctgc actcgacctg ctgaggtccc 8640 

tcagtccctg gtaggcagct ttgccccgtc tgtccgcccg gtgtgtcggc ggggttgaca 8700 

aggtcgttgc gtcagtccaa catttgttgc catattttcc tgctctcccc accagctgct 8760 

cttttctttt ctctttcttt tcccatcttc agtatattca tcttcccatc caagaacctt 8820 

tatttcccct aagtaagtac tttgctacat ccatactcca tccttcccat cccttattcc 8880 

tttgaacctt tcagttcgag ctttcccact tcatcgcagc ttgactaaca gctaccccgc 8940 

ttgagcagac atcaccatgc ctgaactcac cgcgacgtct gtcgagaagt ttctgatcga 9000 

aaagttcgac agcgtctccg acctgatgca gctctcggag ggcgaagaat ctcgtgcttt 9060 

cagcttcgat gtaggagggc gtggatatgt cctgcgggta aatagctgcg ccgatggttt 9120 

ctacaaagat cgttatgttt atcggcactt tgcatcggcc gcgctcccga ttccggaagt 9180 

gcttgacatt ggggaattca gcgagagcct gacctattgc atctcccgcc gtgcacaggg 9240 

tgtcacgttg caagacctgc ctgaaaccga actgcccgct gttctgcagc cggtcgcgga 9300 
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ggccatggat gcgatcgctg cggccgatct tagccagacg agcgggttcg gcccattcgg 9360 

accgcaagga atcggtcaat acactacatg gcgtgatttc atatgcgcga ttgctgatcc 9420 

ccatgtgtat cactggcaaa ctgtgatgga cgacaccgtc agtgcgtccg tcgcgcaggc 9480 

tctcgatgag ctgatgcttt gggccgagga ctgccccgaa gtccggcacc tcgtgcacgc 9540 

ggatttcggc tccaacaatg tcctgacgga caatggccgc ataacagcgg tcattgactg 9600 

gagcgaggcg atgttcgggg attcccaata cgaggtcgcc aacatcttct tctggaggcc 9660 

gtggttggct tgtatggagc agcagacgcg ctacttcgag cggaggcatc cggagcttgc 9720 

aggatcgccg cggctccggg cgtatatgct ccgcattggt cttgaccaac tctatcagag 9780 

cttggttgac ggcaatttcg atgatgcagc ttgggcgcag ggtcgatgcg acgcaatcgt 984 0 

ccgatccgga gccgggactg tcgggcgtac acaaatcgcc cgcagaagcg cggccgtctg 9900 

gaccgatggc tgtgtagaag tactcgccga tagtggaaac cgacgcccca gcactcgtcc 9960 

gagggcaaag gaatagagta gatgccgacc gcgggatcga tccacttaac gttactgaaa 10020 

tcatcaaaca gcttgacgaa tctggatata agatcgttgg tgtcgatgtc agctccggag 10080* 

ttgagacaaa tggtgttcag gatctcgata agatacgttc atttgtccaa gcagcaaaga 10140 

gtgccttcta gtgatttaat agctccatgt caacaagaat aaaacgcgtt ttcgggttta 10200 

cctcttccag atacagctca tctgcaatgc attaatgcat tgactgcaac ctagtaacgc 10260 

cttncaggct ccggcgaaga gaagaatagc ttagcagagc tattttcatt ttcgggagac 10320 

gagatcaagc agatcaacgg tcgtcaagag acctacgaga ctgaggaatc cgctcttggc 10380 

tccacgcgac tatatatttg tctctaattg tactttgaca tgctcctctt ctttactctg 10440 

atagcttgac tatgaaaatt ccgtcaccag cncctgggtt cgcaaagata attgcatgtt 10500 

tcttccttga actctcaagc ctacaggaca cacattcatc gtaggtataa acctcgaaat 10560 
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canttcctac taagatggta tacaatagta accatgcatg gttgcctagt gaatgctccg 10620 

taacacccaa tacgccggcc gaaacttttt tacaactctc ctatgagtcg tttacccaga 10680 

atgcacaggt acacttgttt agaggtaatc cttctttcta gctagaagtc ctcgtgtact 10740 

gtgtaagcgc ccactccaca tctccactcg acctgcaggc atgcaagctt gagattaaaa 10800 

tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt ctttttataa 10860 

atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag atatacaagc 10920 

aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt ttttttttat 10980 

acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct gcaagtttat 11040 

atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc tcatttagtc 11100 

tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat tttactaatt 11160 

cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat gacagtatct 11220 

ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga aataatcttg 11280 

tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac atagcgtgta 11340 

tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt attttatctc 11400 

tatgttgtgg atttggaatg ccctgatcgt tttcgttacc gtgattggca tggaagtgat 11460 

tgctgcactg gcacacaaat acatcatgca cggctggggt tggggatggc atctttcaca 11520 

tcatgaaccg cgtaaaggtg cgtttgaagt taacgatctt tatgccgtgg tttttgctgc 11580 

attatcgatc ctgctgattt atctgggcag tacaggaatg tggccgctcc agtggattgg 11640 

cgcaggtatg acggcgtatg gattactcta ttttatggtg cacgacgggc tggtgcatca 11700 

acgttggcca ttccgctata ttccacgcaa gggctacctc aaacggttgt atatggcgca 11760 
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ccgtatgcat cacgccgtca ggggcaaaga aggttgtgtt tcttttggct tcctctatgc 11820 

gccgcccctg tcaaaacttc aggcgacgct ccgggaaaga catggcgcta gagcgggcgc 11880 

tgccagagat gcgcagggcg gggaggatga gcccgcatcc gggaagtaag ggcctgacca 11940 

gaggcggcca gcagcagcgt taatttttcg ggcgtggtcg ttgactgccg ctgatcccaa 12000 

agcttgagat taaaatagat aaggaaaaga aagtgaaaag aaattcggaa gcatggcaca 12060 

ttcttctttt tataaataca tgcctgactt tctttttcca tcgatatgat atatgcatat 12120 

gatagatata caagcaatct tcttcaagga gtttgaaatt ttgtcctcca ggagcaaaaa 12180 

aaagtttttt tttatacatg tttgtacaca agaatagtta ccaatttgct ttggtcttac 12240 

gtgctgcaag tttatatcgt tttcaatttc tttgtcttta cattttcttt gtcctttatc 12300 

tttcctcatt tagtctttgg gagaattagg aaaagggagc ggaaaggtaa gaaatgcttg 12360 

cgtattttac taattcggca aacatccaat ttggcaaaca gcagcctgtg caacgctctc 12420 

gagatgacag tatctttgat tacactctaa atctcgatga cccgaccaaa aagagcgaac 12480 

aaagaaataa tcttgtgcat: tcgaatatga tggaagattt tttccccctt attctaaatg 12540 

ttgacatagc gtgtatgtta tataaacaaa aagaaattgt acaaactttc ttttcttctc 12600 

tttttatttt atctctatga tccagttaga acaaccactc agtcatcaag caaaactgac 12660 

tccagtactg agaagtaaat ctcagtttaa ggggcttttc attgctattg tcattgttag 12720 

cgcatgggtc attagcctga gtttattact ttcccttgac atctcaaagc taaaattttg 12780 

gatgttattg cctgttatac tatggcaaac atttttatat acgggattat ttattacatc 12840 

tcatgatgcc atgcatggcg tagtatttcc ccaaaacacc aagattaatc atttgattgg 12900 

aacattgacc ctatcccttt atggtctttt accatatcaa aaactattga aaaaacattg 12960 

gttacaccac cacaatccag caagctcaat agacccggat tttcacaatg gtaaacacca 13020 
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aagtttcttt gcttggtatt ttcattttat gaaaggttac tggagttggg ggcaaataat 13080 

tgcgttgact attatttata actttgctaa atacatactc catatcccaa gtgataatct 13140 

aacttacttt tgggtgctac cctcgctttt aagttcatta caattattct attttggtac 13200 

ttttttaccc catagtgaac caataggggg ttatgttcag cctcattgtg cccaaacaat 13260 

tagccgtcct atttggtggt catttatcac gtgctatcat tttggctacc acgaggaaca 13320 

tcacgaatat cctcatattt cttggtggca gttaccagaa atttacaaag caaaatagaa 13380 

gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg ctcacaattc 13440 

cacacaacat acgagccgga agcataaagt gtaaagcctg gggtgcctaa tgagtgagct 13500 

aactcacatt aattgcgttg cgctcactgc ccgctttcca gtcgggaaac ctgtcgtgcc 13560 

agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt gggccaaaga 13620 

caaaagggcg acattcaacc gattgaggga gggaaggtaa atattgacgg aaattattca 13680 

ttaaaggtga attatcaccg tcaccgactt gagccatttg ggaattagag ccagcaaaat 13740 

caccagtagc accattacca ttagcaaggc cggaaacgtc accaatgaaa ccatcgatag 13800 

cagcaccgta atcagtagcg acagaatcaa gtttgccttt agcgtcagac tgtagcgcgt 13860 

tttcatcggc attttcggtc atagccccct tattagcgtt tgccatcttt tcataatcaa 13920 

aatcaccgga accagagcca ccaccggaac cgcctccctc agagccgcca ccctcagaac 13980 

cgccaccctc agagccacca ccctcagagc cgccaccaga accaccacca gagccgccgc 14040 

cagcattgac aggaggcccg atctagtaac atagatgaca ccgcgcgcga taatttatcc 14100 

tagtttgcgc gctatatttt gttttctatc gcgtattaaa tgtataattg cgggactcta" 14160 

atcataaaaa cccatctcat aaataacgtc atgcattaca tgttaattat tacatgctta 14220 
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acgtaattca acagaaatta tatgataatc atcgcaagac cggcaacagg attcaatctt 14280 

aagaaacttt attgccaaat gtttgaacga tcggggatca tccgggtctg tggcgggaac 14340 

tccacgaaaa tatccgaacg cagcaagata tcgcggtgca tctcggtctt gcctgggcag 14400 

tcgccgccga cgccgttgat gtggacgccg ggcccgatca tattgtcgct caggatcgtg 144 60 

gcgttgtgct tgtcggccgt tgctgtcgta atgatatcgg caccttcgac cgcctgttcc 14520 

gcagagatcc cgtgggcgaa gaactccagc atgagatccc cgcgctggag gatcatccag 14580 

ccggcgtccc ggaaaacgat tccgaagccc aacctttcat agaaggcggc ggtggaatcg 14 640 

aaatctcgtg atggcaggtt gggcgtcgct tggtcggtca tttcgaaccc cagagtcccg 14700 

ctcagaagaa ctcgtcaaga aggcgataga aggcgatgcg ctgcgaatcg ggagcggcga 14760 

taccgtaaag cacgaggaag cggtcagccc attcgccgcc aagctcttca gcaatatcac 14820 

gggtagccaa cgctatgtcc tgatagcggt ccgccacacc cagccggcca cagtcgatga 14880 

atccagaaaa gcggccattt tccaccatga tattcggcaa gcaggcatcg ccatgggtca 14 940 

cgacgagatc atcgccgtcg ggcatgcgcg ccttgagcct ggcgaacagt tcggctggcg 15000 

cgagcccctg atgctcttcg tccagatcat cctgatcgac aagaccggct tccatccgag 15060 

tacgtgctcg ctcgatgcga tgtttcgctt ggtggtcgaa tgggcaggta gccggatcaa 15120 

gcgtatgcag ccgccgcatt gcatcagcca tgatggatac tttctcggca ggagcaaggt 15180 

gagatgacag gagatcctgc cccggcactt cgcccaatag cagccagtcc cttcccgctt 15240 

cagtgacaac gtcgagcaca gctgcgcaag gaacgcccgt cgtggccagc cacgatagcc 15300 

gcgctgcctc gtcctgcagt tcattcaggg caccggacag gtcggtcttg acaaaaagaa 15360 

ccgggcgccc ctgcgctgac agccggaaca cggcggcatc agagcagccg attgtctgtt 15420 

gtgcccagtc atagccgaat agcctctcca cccaagcggc cggagaacct gcgtgcaatc 15480 



BASF AG 326/365 January 08, 2004 

BASF NAE 877/03 

catcttgttc aatcatgcga aacgatccag atccggtgca gattatttgg attgagagtg 15540 

aatatgagac tctaattgga taccgagggg aatttatgga acgtcagtgg agcatttttg 15600 

acaagaaata tttgctagct gatagtgacc ttaggcgact tttgaacgcg caataatggt 15660 

ttctgacgta tgtgcttagc tcattaaact ccagaaaccc gcggctgagt ggctccttca 15720 

acgttgcggt tctgtcagtt ccaaacgtaa aacggcttgt cccgcgtcat cggcgggggt 15780 

cataacgtga ctcccttaat tctccgctca tgatcagatt gtcgtttccc gccttcagtt 15840 

taaactatca gtgtttgaca ggatatattg gcgggtaaac ctaagagaaa agagcgttta 15900 

ttagaataat cggatattta aaagggcgtg aaaaggttta tccgttcgtc catttgtatg 15960 

tgcatgccaa ccacagggtt ccccagatct ggcgccggcc agcgagacga gcaagattgg 16020 

ccgccgcccg aaacgatccg acagcgcgcc cagcacaggt gcgcaggcaa attgcaccaa 16080 

cgcatacagc gccagcagaa tgccatagtg ggcggtgacg tcgttcgagt gaaccagatc 16140 

gcgcaggagg cccggcagca ccggcataat caggccgatg ccgacagcgt cgagcgcgac 16200 

agtgctcaga attacgatca ggggtatgtt gggtttcacg tctggcctcc ggaccagcct 16260 

ccgctggtcc gattgaacgc gcggattctt tatcactgat aagttggtgg acatattatg 16320 

tttatcagtg ataaagtgtc aagcatgaca aagttgcagc cgaatacagt gatccgtgcc 16380 

gccctggacc tgttgaacga ggtcggcgta gacggtctga cgacacgcaa actggcggaa 16440 

cggttggggg ttcagcagcc ggcgctttac tggcacttca ggaacaagcg ggcgctgctc 16500 

gacgcactgg ccgaagccat gctggcggag aatcatacgc attcggtgcc gagagccgac 16560 

gacgactggc gctcatttct gatcgggaat gcccgcagct tcaggcaggc gctgctcgcc 16620 

taccgcgatg gcgcgcgcat ccatgccggc acgcgaccgg gcgcaccgca gatggaaacg 16680 
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gccgacgcgc agcttcgctt cctctgcgag 
gcgctgatga caatcagcta cttcactgtt 
agcgatgccg gcgagcgcgg cggcaccgtt 
gccgcgatag acgccttcga cgaagccggt 
gtgattgtcg atggattggc gaaaaggagg 
aagggtgacg attgatcagg accgctgccg 
tgtagacaac atcccctccc cctttccacc 
gctttttcat gccctgccct agcgtccaag 
ccttctggcg ctcttccgct tcctcgctca 
ggcgagcggt atcagctcac tcaaaggcgg 
acgcaggaaa gaacatgtga gcaaaaggcc 
cgttgctggc gtttttccat aggctccgcc 
caagtcagag gtggcgaaa'c ccgacaggac 
gctccctcgt gcgctctcct gttccgaccc 
tcccttcggg aagcgtggcg cttttccgct 
attttttcgg tatatccatc ctttttcgca 
gtagactttc cttggtgtat ccaacggcgt 
cccgcgagcg ggtgttcctt cttcactgtc 
aatcctgctc tgcgaggctg gccggctacc 
gctgatgaaa ccaagccaac caggaagggc 
gacgaacgaa gagcgattga ggaaaaggcg 
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gcgggttttt cggccgggga cgccgtcaat 16740 

ggggccgtgc ttgaggagca ggccggcgac 16800 

gaacaggctc cgctctcgcc gctgttgcgg 16860 

ccggacgcag cgttcgagca gggactcgcg 16920 

ctcgttgtca ggaacgttga aggaccgaga 16980 

gagcgcaacc cactcactac agcagagcca 17040 

gcgtcagacg cccgtagcag cccgctacgg 17100 

cctcacggcc gcgctcggcc tctctggcgg 17160 

ctgactcgct gcgctcggtc gttcggctgc 17220 

taatacggtt atccacagaa tcaggggata 17280 

agcaaaaggc caggaaccgt aaaaaggccg 17340 

cccctgacga gcatcacaaa aatcgacgct 17400 

tataaagata ccaggcgttt ccccctggaa 174 60' 

tgccgcttac cggatacctg tccgcctttc 17520 

gcataaccct gcttcggggt cattatagcg 17580 

cgatatacag gattttgcca aagggttcgt 17640 

cagccgggca ggataggtga agtaggccca 17700 

ccttattcgc acctggcggt gctcaacggg 17760 

gccggcgtaa cagatgaggg caagcggatg 17820 

agcccaccta tcaaggtgta ctgccttcca 17880 

gcggcggccg gcatgagcct gtcggcctac 17940 
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ctgctggccg tcggccaggg ctacaaaatc acgggcgtcg tggactatga gcacgtccgc 18000 

gagctggccc gcatcaatgg cgacctgggc cgcctgggcg gcctgctgaa actctggctc 18060 

accgacgacc cgcgcacggc gcggttcggt gatgccacga tcctcgccct gctggcgaag 18120 

atcgaagaga agcaggacga gcttggcaag gtcatgatgg gcgtggtccg cccgagggca 18180 

gagccatgac ttttttagcc gctaaaacgg ccggggggtg cgcgtgattg ccaagcacgt 18240 

ccccatgcgc tccatcaaga agagcgactt cgcggagctg gtgaagtaca tcaccgacga 18300 

gcaaggcaag accgagcgcc tttgcgacgc tea 18333 

<210> 52 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<220> 

<221> misc_f eature 

<222> (3) . . (3) 

<223> n is a, c, g, or t 

<220> 

<221> misc__f eature 

<222> (9) . . (9) 

<223> n is a, c, g, or t 



<400> 



52 



gengarggna thtggta 



17 



<210> 53 
<211> 20 
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<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<220> 

<221> misc_feature 

<222> (3) . . (3) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (6) . . (6) 

<223> n is a, c, g, or t 

<400> 53 

tcngcnagra adatrttrtg 20 

<210> 54 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 54 

aagtgacacc ggttacacgc ttgtctt 27 



<210> 55 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 
<223> 



Primer 
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<400> 55 



gcttatcacc atctgttacc tccttgc 



27 



<210> 56 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 56 

agagagggat ccttaaatgc gaatatcgtt gc 32 

<210> 57 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



<210> 58 

<211> 37 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 58 

actttattgg atccttaaat gcgaatatcg ttgctgc 37 



<400> 



57 



agagagggat ccatgtctga tcaaaagaag ca 



32 



<210> 59 
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<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 59 

gttccaattg gccacatgaa gagtaagaca ggaaacag 



<210> 60 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 60 

cctgtcttac tcttcatgtg gccaattgga accaacac 



<210> 61 

<211> 38 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 61 

ctattttaat catatgtctg atcaaaagaa gcatattg 



<210> 62 

<211> 16103 

<212> DNA 

<213> Artificial 



<220> 
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<223> Primer 



<220> 

<221> misc_f eature 

<222> (3471) . . (3471) 

<223> n is a, c, g, or t 

<220> 

<221> misc_f eature 

<222> (3679) . . (3679) 

<223> n is a, c, q, or t 

<220> 

<221> mi sc_f eature 

<222> (3770) . . (3770) 

<223> n is a, c, g, or t 

<400> 62 

gatctttcga cactgaaata cgtcgagcct gctccgcttg gaagcggcga ggagcctcgt 60 

cctgtcacaa ctaccaacat ggagtacgat aagggccagt tccgccagct cattaagagc 120 

cagttcatgg gcgttggcat gatggccgtc atgcatctgt acttcaagta caccaacgct 180 

cttctgatcc agtcgatcat ccgctgaagg cgctttcgaa tctggttaag atccacgtct 240 

tcgggaagcc agcgactggt gacctccagc gtccctttaa ggctgccaac agctttctca 300 

gccagggcca gcccaagacc gacaaggcct ccctccagaa cgccgagaag aactggaggg 360 

gtggtgtcaa ggaggagtaa gctccttatt gaagtcggag gacggagcgg tgtcaagagg 420 

atattcttcg actctgtatt atagataaga tgatgaggaa ttggaggtag catagcttca 480 

tttggatttg ctttccaggc tgagactcta gcttggagca tagagggtcc tttggctttc 540 

aatattctca agtatctcga gtttgaactt attccctgtg aaccttttat tcaccaatga 600 



gcattggaat gaacatgaat ctgaggactg caatcgccat gaggttttcg aaatacatcc 660 
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ggatgtcgaa ggcttggggc acctgcgttg gttgaattta gaacgtggca ctattgatca 720 

tccgatagct ctgcaaaggg cgttgcacaa tgcaagtcaa acgttgctag cagttccagg 780 

tggaatgtta tgatgagcat tgtattaaat caggagatat agcatgatct ctagttagct 840 

caccacaaaa gtcagacggc gtaaccaaaa gtcacacaac acaagctgta aggatttcgg 900 

cacggctacg gaagacggag aagccacctt cagtggactc gagtaccatt taattctatt 960 

tgtgtttgat cgagacctaa tacagcccct acaacgacca tcaaagtcgt atagctacca 1020 

gtgaggaagt ggactcaaat cgacttcagc aacatctcct ggataaactt taagcctaaa 1080 

ctatacagaa taagataggt ggagagctta taccgagctc ccaaatctgt ccagatcatg 1140 

gttgaccggt gcctggatct tcctatagaa tcatccttat tcgttgacct agctgattct 1200 

ggagtgaccc agagggtcat gacttgagcc taaaatccgc cgcctccacc atttgtagaa 1260 

aaatgtgacg aactcgtgag ctctgtacag tgaccggtga ctctttctgg catgcggaga 1320 

gacggacgga cgcagagaga agggctgagt aataagccac tggccagaca gctctggcgg 1380 

ctctgaggtg cagtggatga ttattaatcc gggaccggcc gcccctccgc cccgaagtgg 14 40' 

aaaggctggt gtgcccctcg ttgaccaaga atctattgca tcatcggaga atatggagct 1500 

tcatcgaatc accggcagta agcgaaggag aatgtgaagc caggggtgta tagccgtcgg 1560 

cgaaatagca tgccattaac ctaggtacag aagtccaatt gcttccgatc tggtaaaaga 1620 

ttcacgagat agtaccttct ccgaagtagg tagagcgagt acccggcgcg taagctccct 1680 

aattggccca tccggcatct gtagggcgtc caaatatcgt gcctctcctg ctttgcccgg 1740 

tgtatgaaac cggaaaggcc gctcaggagc tggccagcgg cgcagaccgg gaacacaagc 1800 

tggcagtcga cccatccggt gctctgcact cgacctgctg aggtccctca gtccctggta 1860 

ggcagctttg ccccgtctgt ccgcccggtg tgtcggcggg gttgacaagg tcgttgcgtc 1920 
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agtccaacat ttgttgccat attttcctgc tctccccacc agctgctctt ttcttttctc 1980 

tttcttttcc catcttcagt atattcatct tcccatccaa gaacctttat ttcccctaag 2040 

taagtacttt gctacatcca tactccatcc ttcccatccc ttattccttt gaacctttca 2100 

gttcgagctt tcccacttca tcgcagcttg actaacagct accccgcttg agcagacatc 2160 

accatgcctg aactcaccgc gacgtctgtc gagaagtttc tgatcgaaaa gttcgacagc 2220 

gtctccgacc tgatgcagct ctcggagggc gaagaatctc gtgctttcag cttcgatgta 2280 

ggagggcgtg gatatgtcct gcgggtaaat agctgcgccg atggtttcta caaagatcgt 2340 

tatgtttatc ggcactttgc atcggccgcg ctcccgattc cggaagtgct tgacattggg 2400 

gaattcagcg agagcctgac ctattgcatc tcccgccgtg cacagggtgt cacgttgcaa 24 60 

gacctgcctg aaaccgaact gcccgctgtt ctgcagccgg tcgcggaggc catggatgcg 2520 

atcgctgcgg ccgatcttag ccagacgagc gggttcggcc cattcggacc gcaaggaatc 2580 

ggtcaataca ctacatggcg tgatttcata tgcgcgattg ctgatcccca tgtgtatcac 2640 

tggcaaactg tgatggacga caccgtcagt gcgtccgtcg cgcaggctct cgatgagctg 2700 

atgctttggg ccgaggactg ccccgaagtc cggcacctcg tgcacgcgga tttcggctcc 2760 

aacaatgtcc tgacggacaa tggccgcata acagcggtca ttgactggag cgaggcgatg 2820 

ttcggggatt cccaatacga ggtcgccaac atcttcttct ggaggccgtg gttggcttgt 2880 

atggagcagc agacgcgcta cttcgagcgg aggcatccgg agcttgcagg atcgccgcgg 294 0 

ctccgggcgt atatgctccg cattggtctt gaccaactct atcagagctt ggttgacggc 3000 

aatttcgatg atgcagcttg ggcgcagggt cgatgcgacg caatcgtccg atccggagcc 3060 

gggactgtcg ggcgtacaca aatcgcccgc agaagcgcgg ccgtctggac cgatggctgt 3120 
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gtagaagtac tcgccgatag tggaaaccga cgccccagca ctcgtccgag ggcaaaggaa 3180 

tagagtagat gccgaccgcg ggatcgatcc acttaacgtt actgaaatca tcaaacagct 3240 

tgacgaatct ggatataaga tcgttggtgt cgatgtcagc tccggagttg agacaaatgg 3300 

tgttcaggat ctcgataaga tacgttcatt tgtccaagca gcaaagagtg ccttctagtg 3360 

atttaatagc tccatgtcaa caagaataaa acgcg.ttttc gggtttacct cttccagata 3420 

cagctcatct gcaatgcatt aatgcattga ctgcaaccta gtaacgcctt ncaggctccg 3480 

gcgaagagaa gaatagctta gcagagctat tttcattttc gggagacgag atcaagcaga 3540 

tcaacggtcg tcaagagacc tacgagactg aggaatccgc tcttggctcc acgcgactat 3600 

atatttgtct ctaattgtac tttgacatgc tcctcttctt tactctgata gcttgactat 3660 

gaaaattccg tcaccagcnc ctgggttcgc aaagataatt gcatgtttct tccttgaact 3720 

ctcaagccta caggacacac attcatcgta ggtataaacc tcgaaatcan ttcctactaa 3780 

gatggtatac aatagtaacc atgcatggtt gcctagtgaa tgctccgtaa cacccaatac 3840 

gccggccgaa acttttttac aactctccta tgagtcgttt acccagaatg cacaggtaca 3900 

cttgtttaga ggtaatcctt ctttctagct agaagtcctc gtgtactgtg taagcgccca 3960 

ctccacatct ccactcgacc tgcaggcatg caagcttgag tctatcgcct ccaaaaagta 4020 

cggtgctgaa ttcagatatc aatcgcctgt tgctaaaatt aacactgtcg ataaagacaa 4080 

gcgtgtaacc ggtgtcactt tggaaagcgg agaagtcatt gaagccgatg cagtcgtatg 4140 

taatgcggat cttgtttatg cttatcacca tctgttacct ccttgcaatt ggacaaagaa 4200 

gacattagcc tcaaagaaac tcacttcatc atctatttcg ttttattggt ccatgtcaac 4260 

aaaggtgcct caattagacg tacacaatat cttcttggct gaagcctaca aggaaagttt 4320 

tgatgagatt ttcaacgact tcggtttgcc ctctgaagct tggcgtaatc atggtcatag 4380 
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ctgtttcctg tgtgaaattg ttatccgctc acaattccac acaacatacg agccggaagc 44 40 

ataaagtgta aagcctgggg tgcctaatga gtgagctaac tcacattaat tgcgttgcgc 4500 

tcactgcccg ctttccagtc gggaaacctg tcgtgccagc tgcattaatg aatcggccaa 4560 

cgcgcgggga gaggcggttt gcgtattggg ccaaagacaa aagggcgaca ttcaaccgat 4 620 

tgagggaggg aaggtaaata ttgacggaaa ttattcatta aaggtgaatt atcaccgtca 4 680 

ccgacttgag ccatttggga attagagcca gcaaaatcac cagtagcacc attaccatta 4740 

gcaaggccgg aaacgtcacc aatgaaacca tcgatagcag caccgtaatc agtagcgaca 4800 

gaatcaagtt tgcctttagc gtcagactgt agcgcgtttt catcggcatt ttcggtcata 4860 

gcccccttat tagcgtttgc catcttttca taatcaaaat caccggaacc agagccacca 4 920 

ccggaaccgc ctccctcaga gccgccaccc tcagaaccgc caccctcaga gccaccaccc 4 980 

tcagagccgc caccagaacc accaccagag ccgccgccag cattgacagg aggcccgatc 5040 

tagtaacata gatgacaccg cgcgcgataa tttatcctag tttgcgcgct atattttgtt 5100 

ttctatcgcg tattaaatgt ataattgcgg gactctaatc ataaaaaccc atctcataaa 5160 

taacgtcatg cattacatgt taattattac atgcttaacg taattcaaca gaaattatat 5220 

gataatcatc gcaagaccgg caacaggatt caatcttaag aaactttatt gccaaatgtt 5280 

tgaacgatcg gggatcatcc gggtctgtgg cgggaactcc acgaaaatat ccgaacgcag 5340 

caagatatcg cggtgcatct cggtcttgcc tgggcagtcg ccgccgacgc cgttgatgtg 5400 

gacgccgggc ccgatcatat tgtcgctcag gatcgtggcg ttgtgcttgt cggccgttgc 5460 

tgtcgtaatg atatcggcac cttcgaccgc ctgttccgca gagatcccgt gggcgaagaa 5520 

ctccagcatg agatccccgc gctggaggat catccagccg gcgtcccgga aaacgattcc 5580 
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gaagcccaac ctttcataga aggcggcggt ggaatcgaaa tctcgtgatg gcaggttggg 5640 

cgtcgcttgg tcggtcattt cgaaccccag agtcccgctc agaagaactc gtcaagaagg 5700 

cgatagaagg cgatgcgctg cgaatcggga gcggcgatac cgtaaagcac gaggaagcgg 5760 

tcagcccatt cgccgccaag ctcttcagca atatcacggg tagccaacgc tatgtcctga 5820 

tagcggtccg ccacacccag ccggccacag tcgatgaatc cagaaaagcg gccattttcc 5880 

accatgatat tcggcaagca ggcatcgcca tgggtcacga cgagatcatc gccgtcgggc 5940 

atgcgcgcct tgagcctggc gaacagttcg gctggcgcga gcccctgatg ctcttcgtcc 6000 

agatcatcct gatcgacaag accggcttcc atccgagtac gtgctcgctc gatgcgatgt 6060 

ttcgcttggt ggtcgaatgg gcaggtagcc ggatcaagcg tatgcagccg ccgcattgca 6120 

tcagccatga tggatacttt ctcggcagga gcaaggtgag atgacaggag atcctgcccc 6180 

ggcacttcgc ccaatagcag ccagtccctt cccgcttcag tgacaacgtc gagcacagct 6240 

gcgcaaggaa cgcccgtcgt ggccagccac gatagccgcg ctgcctcgtc ctgcagttca 6300 

ttcagggcac cggacaggtrc ggtcttgaca aaaagaaccg ggcgcccctg cgctgacagc 6360 

cggaacacgg cggcatcaga gcagccgatt gtctgttgtg cccagtcata gccgaatagc 6420 

ctctccaccc aagcggccgg agaacctgcg tgcaatccat cttgttcaat catgcgaaac 6480 

gatccagatc cggtgcagat tatttggatt gagagtgaat atgagactct aattggatac 6540 

cgaggggaat ttatggaacg tcagtggagc atttttgaca agaaatattt gctagctgat 6600 

agtgacctta ggcgactttt gaacgcgcaa taatggtttc tgacgtatgt gcttagctca 6660 

ttaaactcca gaaacccgcg gctgagtggc tccttcaacg ttgcggttct gtcagttcca 6720 

aacgtaaaac ggcttgtccc gcgtcatcgg cgggggtcat aacgtgactc ccttaattct 6780 

ccgctcatga tcagattgtc gtttcccgcc ttcagtttaa actatcagtg tttgacagga 6840 
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tatattggcg ggtaaaccta agagaaaaga gcgtttatta gaataatcgg atatttaaaa 6900 

gggcgtgaaa aggtttatcc gttcgtccat ttgtatgtgc atgccaacca cagggttccc 6960 

cagatctggc gccggccagc gagacgagca agattggccg ccgcccgaaa cgatccgaca 7020 

gcgcgcccag cacaggtgcg caggcaaatt gcaccaacgc atacagcgcc agcagaatgc 7080 

catagtgggc ggtgacgtcg ttcgagtgaa ccagatcgcg caggaggccc ggcagcaccg 714 0 

gcataatcag gccgatgccg acagcgtcga gcgcgacagt gctcagaatt acgatcaggg 7200 

gtatgttggg tttcacgtct ggcctccgga ccagcctccg ctggtccgat tgaacgcgcg 7260 

gattctttat cactgataag ttggtggaca tattatgttt atcagtgata aagtgtcaag 7320 

catgacaaag ttgcagccga atacagtgat ccgtgccgcc ctggacctgt tgaacgaggt 7380 

cggcgtagac ggtctgacga cacgcaaact ggcggaacgg ttgggggttc agcagccggc 7440 

gctttactgg cacttcagga acaagcgggc gctgctcgac gcactggccg aagccatgct 7 500 

ggcggagaat catacgcatt cggtgccgag agccgacgac gactggcgct catttctgat 7560 

cgggaatgcc cgcagcttca ggcaggcgct gctcgcctac cgcgatggcg cgcgcatcca 7 620 

tgccggcacg cgaccgggcg caccgcagat ggaaacggcc gacgcgcagc ttcgcttcct 7 680 

ctgcgaggcg ggtttttcgg ccggggacgc cgtcaatgcg ctgatgacaa tcagctactt 7740 

cactgttggg gccgtgcttg aggagcaggc cggcgacagc gatgccggcg agcgcggcgg 7800 

caccgttgaa caggctccgc tctcgccgct gttgcgggcc gcgatagacg ccttcgacga 7860 

agccggtccg gacgcagcgt tcgagcaggg actcgcggtg attgtcgatg gattggcgaa 7 920 

aaggaggctc gttgtcagga acgttgaagg accgagaaag ggtgacgatt gatcaggacc 7980 

gctgccggag cgcaacccac tcactacagc agagccatgt agacaacatc ccctccccct 8040 
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ttccaccgcg tcagacgccc gtagcagccc gctacgggct ttttcatgcc ctgccctagc 8100 

gtccaagcct cacggccgcg ctcggcctct ctggcggcct tctggcgctc ttccgcttcc 8160 

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca 8220 

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca 8280 

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg 834 0 

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg 8400 

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt 8460 

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt 8520 

ttccgctgca taaccctgct tcggggtcat tatagcgatt ttttcggtat atccatcctt 8580 

tttcgcacga tatacaggat tttgccaaag ggttcgtgta gactttcctt ggtgtatcca 8640 

acggcgtcag ccgggcagga taggtgaagt aggcccaccc gcgagcgggt gttccttctt 8700 

cactgtccct tattcgcacc tggcggtgct caacgggaat cctgctctgc gaggctggcc 8760 

ggctaccgcc ggcgtaacag atgagggcaa gcggatggct gatgaaacca agccaaccag 8820' 

gaagggcagc ccacctatca aggtgtactg ccttccagac gaacgaagag cgattgagga 8880 

aaaggcggcg gcggccggca tgagcctgtc ggcctacctg ctggccgtcg gccagggcta 8940 

caaaatcacg ggcgtcgtgg actatgagca cgtccgcgag ctggcccgca tcaatggcga 9000 

cctgggccgc ctgggcggcc tgctgaaact ctggctcacc gacgacccgc gcacggcgcg 9060 

gttcggtgat gccacgatcc tcgccctgct ggcgaagatc gaagagaagc aggacgagct 9120 

tggcaaggtc atgatgggcg tggtccgccc gagggcagag ccatgacttt tttagccgct 9180 

aaaacggccg gggggtgcgc gtgattgcca agcacgtccc catgcgctcc atcaagaaga 9240 

gcgacttcgc ggagctggtg aagtacatca ccgacgagca aggcaagacc gagcgccttt 9300 
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gcgacgctca ccgggctggt tgccctcgcc gctgggctgg cggccgtcta tggccctgca 9360 

aacgcgccag aaacgccgtc gaagccgtgt gcgagacacc gcggccgccg gcgttgtgga 9420 

tacctcgcgg aaaacttggc cctcactgac agatgagggg cggacgttga cacttgaggg 9480 

gccgactcac ccggcgcggc gttgacagat gaggggcagg ctcgatttcg gccggcgacg 9540 

tggagctggc cagcctcgca aatcggcgaa aacgcctgat tttacgcgag tttcccacag 9600 

atgatgtgga caagcctggg gataagtgcc ctgcggtatt gacacttgag gggcgcgact 9660 

actgacagat gaggggcgcg atccttgaca cttgaggggc agagtgctga cagatgaggg 9720 

gcgcacctat tgacatttga ggggctgtcc acaggcagaa aatccagcat ttgcaagggt 9780 

ttccgcccgt ttttcggcca ccgctaacct gtcttttaac ctgcttttaa accaatattt 9840 

ataaaccttg tttttaacca gggctgcgcc ctgtgcgcgt gaccgcgcac gccgaagggg 9900 

ggtgcccccc cttctcgaac cctcccggcc cgctaacgcg ggcctcccat ccccccaggg 9960 

gctgcgcccc tcggccgcga acggcctcac cccaaaaatg gcagcgctgg cagtccttgc 10020 

cattgccggg atcggggcag taacgggatg ggcgatcagc ccgagcgcga cgcccggaag 10080 

cattgacgtg ccgcaggtgc tggcatcgac attcagcgac caggtgccgg gcagtgaggg 10140 

cggcggcctg ggtggcggcc tgcccttcac ttcggccgtc ggggcattca cggacttcat 10200 

ggcggggccg gcaattttta ccttgggcat tcttggcata gtggtcgcgg gtgccgtgct 10260 

cgtgttcggg ggtgcgataa acccagcgaa ccatttgagg tgataggtaa gattataccg 10320 

aggtatgaaa acgagaattg gacctttaca gaattactct atgaagcgcc atatttaaaa 10380 

agctaccaag acgaagagga tgaagaggat gaggaggcag attgccttga atatattgac 10440 

aatactgata agataatata tcttttatat agaagatatc gccgtatgta aggatttcag 10500 
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ggggcaaggc ataggcagcg cgcttatcaa tatatctata gaatgggcaa agcataaaaa 10560 

cttgcatgga ctaatgcttg aaacccagga caataacctt atagcttgta aattctatca 10620 

taattgggta atgactccaa cttattgata gtgttttatg ttcagataat gcccgatgac 10680 

tttgtcatgc agctccaccg attttgagaa cgacagcgac ttccgtccca gccgtgccag 10740 

gtgctgcctc agattcaggt tatgccgctc aattcgctgc gtatatcgct tgctgattac 10800 

gtgcagcttt cccttcaggc gggattcata cagcggccag ccatccgtca tccatatcac 10860 

cacgtcaaag ggtgacagca ggctcataag acgccccagc gtcgccatag tgcgttcacc 10920 

gaatacgtgc gcaacaaccg tcttccggag actgtcatac gcgtaaaaca gccagcgctg 10980 

gcgcgattta gccccgacat agccccactg ttcgtccatt tccgcgcaga cgatgacgtc 11040 

actgcccggc tgtatgcgcg aggttaccga ctgcggcctg agttttttaa gtgacgtaaa 11100 

atcgtgttga ggccaacgcc cataatgcgg gctgttgccc ggcatccaac gccattcatg 11160 

gccatatcaa tgattttctg gtgcgtaccg ggttgagaag cggtgtaagt gaactgcagt 11220 

tgccatgttt tacggcagtg agagcagaga tagcgctgat gtccggcggt gcttttgccg 11280 

ttacgcacca ccccgtcagt agctgaacag gagggacagc tgatagacac agaagccact 11340 

ggagcacctc aaaaacacca tcatacacta aatcagtaag ttggcagcat cacccataat 11400 

tgtggtttca aaatcggctc cgtcgatact atgttatacg ccaactttga aaacaacttt 11460 

gaaaaagctg ttttctggta tttaaggttt tagaatgcaa ggaacagtga attggagttc 11520 

gtcttgttat aattagcttc ttggggtatc tttaaatact gtagaaaaga ggaaggaaat 11580 

aataaatggc taaaatgaga atatcaccgg aattgaaaaa actgatcgaa aaataccgct 11640 

gcgtaaaaga tacggaagga atgtctccrg ctaaggtata taagctggtg ggagaaaatg 11700 

aaaacctata tttaaaaatg acggacagcc ggtataaagg gaccacctat gatgtggaac 11760 
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gggaaaagga catgatgcta tggctggaag gaaagctgcc tgttccaaag gtcctgcact 

ttgaacggca tgatggctgg agcaatctgc tcatgagtga ggccgatggc gtcctttgct 

cggaagagta tgaagatgaa caaagccctg aaaagattat cgagctgtat gcggagtgca 

tcaggctctt tcactccatc gacatatcgg attgtcccta tacgaatagc ttagacagcc 

gcttagccga attggattac ttactgaata acgatctggc cgatgtggat tgcgaaaact 

gggaagaaga cactccattt aaagatccgc gcgagctgta tgatttttta aagacggaaa 

agcccgaaga ggaacttgtc ttttcccacg gcgacctggg agacagcaac atctttgtga 

• aagatggcaa agtaagtggc tttattgatc ttgggagaag cggcagggcg gacaagtggt 

atgacattgc cttctgcgtc cggtcgatca gggaggatat cggggaagaa cagtatgtcg 

agctattttt tgacttactg gggatcaagc ctgattggga gaaaataaaa tattatattt 

tactggatga attgttttag tacctagatg tggcgcaacg atgccggcga caagcaggag 

cgcaccgact tcttccgcat caagtgtttt ggctctcagg ccgaggccca cggcaagtat 

ttgggcaagg ggtcgctggt attcgtgcag ggcaagattc ggaataccaa gtacgagaag 

gacggccaga cggtctacgg gaccgacttc attgccgata aggtggatta tctggacacc 

aaggcaccag gcgggtcaaa tcaggaataa gggcacattg ccccggcgtg agtcggggca 

atcccgcaag gagggtgaat gaatcggacg tttgaccgga aggcatacag gcaagaactg 

atcgacgcgg ggttttccgc cgaggatgcc gaaaccatcg caagccgcac cgtcatgcgt 

gcgccccgcg aaaccttcca gtccgtcggc tcgatggtcc agcaagctac ggccaagatc 

gagcgcgaca gcgtgcaact ggctccccct gccctgcccg cgccatcggc cgccgtggag* 

cgttcgcgtc gtctcgaaca ggaggcggca ggtttggcga agtcgatgac catcgacacg 



11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
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cgaggaacta tgacgaccaa gaagcgaaaa accgccggcg aggacctggc aaaacaggtc 13020 

agcgaggcca agcaggccgc gttgctgaaa cacacgaagc agcagatcaa ggaaatgcag 13080 

ctttccttgt tcgatattgc gccgtggccg gacacgatgc gagcgatgcc aaacgacacg 13140 

gcccgctctg ccctgttcac cacgcgcaac aagaaaatcc cgcgcgaggc gctgcaaaac 13200 

aaggtcattt tccacgtcaa caaggacgtg aagatcacct acaccggcgt cgagctgcgg 13260 

gccgacgatg acgaactggt gtggcagcag gtgttggagt acgcgaagcg cacccctatc 13320 

ggcgagccga tcaccttcac gttctacgag ctttgccagg acctgggctg gtcgatcaat 13380 

ggccggtatt acacgaaggc cgaggaatgc ctgtcgcgcc tacaggcgac ggcgatgggc 13440 

ttcacgtccg accgcgttgg gcacctggaa tcggtgtcgc tgctgcaccg cttccgcgtc 13500 

ctggaccgtg gcaagaaaac gtcccgtcgc caggtcctga tcgacgagga aatcgtcgtg 13560 

ctgtttgctg gcgaccacta cacgaaattc atatgggaga agtaccgcaa gctgtcgccg 13620 

acggcccgac ggatgttcga ctatttcagc tcgcaccggg agccgtaccc gctcaagctg 13680 

gaaaccttcc gcctcatgtrg cggatcggat tccacccgcg tgaagaagtg gcgcgagcag 13740 

gtcggcgaag cctgcgaaga gttgcgaggc agcggcctgg tggaacacgc ctgggtcaat 13800 

gatgacctgg tgcattgcaa acgctagggc cttgtggggt cagttccggc tgggggttca 13860 

gcagccagcg ctttactggc atttcaggaa caagcgggca ctgctcgacg cacttgcttc 13920 

gctcagtatc gctcgggacg cacggcgcgc tctacgaact gccgataaac agaggattaa 13980 

aattgacaat tgtgattaag gctcagattc gacggcttgg agcggccgac gtgcaggatt 14040 

tccgcgagat ccgattgtcg gccctgaaga aagctccaga gatgttcggg tccgtttacg 14100 

agcacgagga gaaaaagccc atggaggcgt tcgctgaacg gttgcgagat gccgtggcat 14160 

tcggcgccta catcgacggc gagatcattg ggctgtcggt cttcaaacag gaggacggcc 14220 
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ccaaggacgc tcacaaggcg catctgtccg gcgttttcgt ggagcccgaa cagcgaggcc 14280 

gaggggtcgc cggtatgctg ctgcgggcgt tgccggcggg tttattgctc gtgatgatcg 14340 

tccgacagat tccaacggga atctggtgga tgcgcatctt catcctcggc gcacttaata 14400 

tttcgctatt ctggagcttg ttgtttattt cggtctaccg cctgccgggc ggggtcgcgg 14460 

cgacggtagg cgctgtgcag ccgctgatgg tcgtgttcat ctctgccgct ctgctaggta 14520 

gcccgatacg attgatggcg gtcctggggg ctatttgcgg aactgcgggc gtggcgctgt 14580 

tggtgttgac accaaacgca gcgctagatc ctgtcggcgt cgcagcgggc ctggcggggg 14640 

cggtttccat ggcgttcgga accgtgctga cccgcaagtg gcaacctccc gtgcctctgc 14700 

tcacctttac cgcctggcaa ctggcggccg gaggacttct gctcgttcca gtagctttag 14760 

tgtttgatcc gccaatcccg atgcctacag gaaccaatgt tctcggcctg gcgtggctcg 14820 

gcctgatcgg agcgggttta acctacttcc tttggttccg ggggatctcg cgactcgaac 14880 

ctacagttgt ttccttactg ggctttctca gccccagatc tggggtcgat cagccgggga 14940 

tgcatcaggc cgacagtcgg aacttcgggt ccccgacctg taccattcgg tgagcaatgg 15000 

ataggggagt tgatatcgtc aacgttcact tctaaagaaa tagcgccact cagcttcctc 15060 

agcggcttta tccagcgatt tcctattatg tcggcatagt tctcaagatc gacagcctgt 15120 

cacggttaag cgagaaatga ataagaaggc tgataattcg gatctctgcg agggagatga 15180 

tatttgatca caggcagcaa cgctctgtca tcgttacaat caacatgcta ccctccgcga 15240 

gatcatccgt gtttcaaacc cggcagctta gttgccgttc ttccgaatag catcggtaac 15300 

atgagcaaag tctgccgcct tacaacggct ctcccgctga cgccgtcccg gactgatggg 15360 ' 

ctgcctgtat cgagtggtga ttttgtgccg agctgccggt cggggagctg ttggctggct 15420 
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ggtggcagga tatattgtgg tgtaaacaaa ttgacgctta gacaacttaa taacacattg 15480 

cggacgtttt taatgtactg gggtggtttt tcttttcacc agtgagacgg gcaacagctg 15540 

attgcccttc accgcctggc cctgagagag ttgcagcaag cggtccacgc tggtttgccc 15600 

cagcaggcga aaatcctgtt tgatggtggt tccgaaatcg gcaaaatccc ttataaatca 15660 

aaagaatagc ccgagatagg gttgagtgtt gttccagttt ggaacaagag tccactatta 15720 

aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga tggcccacta 15780 

cgtgaaccat cacccaaatc aagttttttg gggtcgaggt gccgtaaagc actaaatcgg 15840 

aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa cgtggcgaga 15900 

aaggaaggga agaaagcgaa aggagcgggc gccattcagg ctgcgcaact gttgggaagg 15960 

gcgatcggtg cgggcctctt cgctattacg ccagctggcg aaagggggat gtgctgcaag 16020 

gcgattaagt tgggtaacgc cagggttttc ccagtcacga cgttgtaaaa cgacggccag 16080 

tgaattcgag ctcggtaccc ggg 16103 

<210> 63 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 
<400> 63 

ggcgtacttg aaggaaccct taccg 25 

<210> 64 

<211> 25 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer 
<400> 64 

attgatgctc ccggtcaccg tgatt 25 



<210> 65 
<211> 500 
<212> DNA 

<213> Blakeslea trispora 
<400> 65 

aatctataca atgctccata gactcacatt gatattgtcg aagatttcga tgctgactta 60 
gtagagcaac tacaaaagtt agcagagaag catgatttct taatctttga agaccgcaag 120 
tttgcagata tcggtatgtg aattctatct attttttttc tgatgtgtgc atggatgact 180 
catgatcata ttcttaggta atactgtcaa gcatcaatat ggcaagggcg tttacaagat 240 
tgcttcttgg tctcatatta ctaatgctca cacagttcct ggagaaggta ttatcaaggg 300 
acttgccgaa gtcggcctcc ctcttggtcg tggcttgctt ttgctagcag aaatgtcatc 360 
tcaaggtgca ttaactaagg gtatttacac tgccgaatct gtcaatatgg ctcgccgcaa 420 
caaagatttc gtttttggct ttattgcaca acacaaaatg aatcagtatg atgatgagga 480 
ttttgttgtc atgtcgcctg 500 



<210> 66 
<211> 611 
<212> DNA 

<213> Blakeslea trispora 
<400> 66 

gagattaaaa tagataagga aaagaaagtg aaaagaaatt cggaagcatg gcacattctt 60 
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ctttttataa atacatgcct gactttcttt ttccatcgat atgatatatg catatgatag 120 

atatacaagc aatcttcttc aaggagtttg aaattttgtc ctccaggagc aaaaaaaagt 180 

ttttttttat acatgtttgt acacaagaat agttaccaat ttgctttggt cttacgtgct 240 

gcaagtttat atcgttttca atttctttgt ctttacattt tctttgtcct ttatctttcc 300 

tcatttagtc tttgggagaa ttaggaaaag ggagcggaaa ggtaagaaat gcttgcgtat 360 

tttactaatt cggcaaacat ccaatttggc aaacagcagc ctgtgcaacg ctctcgagat 420 

gacagtatct ttgattacac tctaaatctc gatgacccga ccaaaaagag cgaacaaaga 480 

aataatcttg tgcattcgaa tatgatggaa gattttttcc cccttattct aaatgttgac 540 

atagcgtgta tgttatataa acaaaaagaa attgtacaaa ctttcttttc ttctcttttt 600 

attttatctc t 611 

<210> 67 
<211> 720 
<212> DNA 

<213> Blakeslea trispora 
<400> 67 

atgtcaatac tcacttatct ggaatttcat ctctactata cactacctgt ccttgcggca 60 

ttgtgttggc tgctaaagcc gtttcactca cagcaagaca atctcaagta taaattttta 120 

atgttgatgg ccgcctctac cgcatcgatt tgggacaatt atatcgttta tcatcgcgct 180 

tggtggtact gtcctacttg tgttgtggct gtcattggct atgtacctct agaagaatac 240 

atgttcttta tcatcatgac tttaatgact gtcgcgttct caaactttgt tatgcgttgg 300 

cacttgcata ctttctttat tagacccaac acttcttgga agcaaacact attagtacgc 360 

cttgtgcctg tttcagcttt attggcaatc acttatcatg cttggcactt gacactgcca 420 
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aataaacctt cattttatgg ttcatgcatc ctttggtatg cttgtcctgt gttggctatt 480 

ctttggctgg gtgctggcga atatatcttg cgtcgacctg tggctgtcct tttgtctatt 540 

gttatcccta gtgtatacct atgttgggct gatatcgtcg ctattagtgc tggcacatgg 600 

catatttctc ttagaacaag cactggcaaa atggtagtac ccgatttacc tgtagaagaa 660 

tgcctgtttt ttactttgat caacacagtc ttggtttttg ctacctgtgc tatagaccgc 720 

<210> 68 
<211> 1089 
<212> DNA 

<213> Blakeslea trispora 
<400> 68 

ctgtacaaat catctgttca aaatcaaaac cctaaacaag ccatttccct tttccagcat 60 

gtcaaagagc tagcatgggc cttctgtctt cctgaccaaa tgctcaacaa tgaattgttt 120 

gatgatctta ctatcagctg ggatatttta cgtaaagcct caaagtcatt ctatactgca 180 

tctgccgttt ttccaagtta tgtacgtcaa gacttgggtg ttctctatgc tttctgcaga 240 

gctaccgatg acctgtgcga tgatgaatcc aaatctgttc aagaaagaag agaccaatta 300 

gatcttactc gacaatttgt tcgtgatctc tttagccaaa agaccagtgc gcctattgtg 360 

attgattggg aattgtatca aaaccaactt cctgcttctt gtatatcagc ctttagagcc 420 

tttactcgcc ttcgccatgt ccttgaagta gaccctgtag aagaactatt agatggttac 480 

aaatgggatc ttgagcgtcg tcctatcctt gatgaacaag acttggaggc atactctgct 540 

tgtgtggcca gtagtgtggg tgaaatgtgc acacgtgtga ttcttgctca agaccaaaag 600 
gaaaatgatg cttggataat tgaccgtgca cgtgagatgg ggctggtgct acaatacgtt ■ 660 

aacattgctc gagacattgt gactgatagc gagactctgg gtcgatgtta tctgcctcaa 720 
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caatggctta gaaaagaaga aacagaacaa atacagcaag gcaacgcccg tagcctaggt 780 

gatcaaagac tgttgggctt gtctctgaag cttgtaggaa aggcagacgc tatcatggtg 840 

agagctaaga agggcattga caagttgccg gcaaactgtc aaggcggtgt acgagctgct 900 

tgccaagtat atgctgcaat tggatctgta ctcaagcagc agaagacaac atatcctaca 960 

agagctcatc taaaaggaag cgaacgtgcc aagattgctc tgttgagtgt atacaacctc 1020 

tatcaatctg aagacaagcc tgtggctctc cgtcaagcta gaaagattaa gagttttttt 1080 

gttgattag 1089 

<210> 69 
<211> 611 
<212> DNA 

<213> Blakeslea trispora 
<400> 69 

agagataaaa taaaaagaga agaaaagaaa gtttgtacaa tttctttttg tttatataac 60 

atacacgcta tgtcaacatt tagaataagg gggaaaaaat cttccatcat attcgaatgc 120 

acaagattat ttctttgttc gctctttttg gtcgggtcat cgagatttag agtgtaatca 180 

aagatactgt catctcgaga gcgttgcaca ggctgctgtt tgccaaattg gatgtttgcc 24 0 

gaattagtaa aatacgcaag catttcttac ctttccgctc ccttttccta attctcccaa 300 

agactaaatg aggaaagata aaggacaaag aaaatgtaaa gacaaagaaa ttgaaaacga 360 

tataaacttg cagcacgtaa gaccaaagca aattggtaac tattcttgtg tacaaacatg 420 

tataaaaaaa aacttttttt tgctcctgga ggacaaaatt tcaaactcct tgaagaagat 480 

tgcttgtata tctatcatat gcatatatca tatcgatgga aaaagaaagt caggcatgta 540 

tttataaaaa gaagaatgtg ccatgcttcc gaatttcttt tcactttctt ttccttatct 600 
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attttaatct c 611 



<210> 70 
<211> 882 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 70 

atgctgtcga agctgcagtc aatcagcgtc aaggcccgcc gcgttgaact agcccgcgac 60 

atcacgcggc ccaaagtctg cctgcatgct cagcggtgct cgttagttcg gctgcgagtg 120 

gcagcaccac agacagagga ggcgctggga accgtgcagg ctgccggcgc gggcgatgag 180 

cacagcgccg atgtagcact ccagcagctt gaccgggcta tcgcagagcg tcgtgcccgg 240 

cgcaaacggg agcagctgtc ataccaggct gccgccattg cagcatcaat tggcgtgtca 300 

ggcattgcca tcttcgccac ctacctgaga tttgccatgc acatgaccgt gggcggcgca 360 

gtgccatggg gtgaagtggc tggcactctc ctcttggtgg ttggtggcgc gctcggcatg 420 

gagatgtatg cccgctatgc acacaaagcc atctggcatg agtcgcctct gggctggctg 480 

ctgcacaaga gccaccacac acctcgcact ggaccctttg aagccaacga cttgtttgca 540 

atcatcaatg gactgcccgc catgctcctg tgtacctttg gcttctggct gcccaacgtc 600 

ctgggggcgg cctgctttgg agcggggctg ggcatcacgc tatacggcat ggcatatatg 660 

tttgtacacg atggcctggt gcacaggcgc tttcccaccg ggcccatcgc tggcctgccc 720 

tacatgaagc gcctgacagt ggcccaccag ctacaccaca gcggcaagta cggtggcgcg 780 

ccctggggta tgttcttggg tccacaggag ctgcagcaca ttccaggtgc ggcggaggag 840 

gtggagcgac tggtcctgga actggactgg tccaagcggt ag 882 



<210> 71 
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<211> 528 
<212> DNA 

<213> Erwinia uredovora 
<400> 71 

atgttgtgga tttggaatgc cctgatcgtt ttcgttaccg tgattggcat ggaagtgatt 60 

gctgcactgg cacacaaata catcatgcac ggctggggtt ggggatggca tctttcacat 120 

catgaaccgc gtaaaggtgc gtttgaagtt aacgatcttt atgccgtggt ttttgctgca 180 

ttatcgatcc tgctgattta tctgggcagt acaggaatgt ggccgctcca gtggattggc 240 

gcaggtatga cggcgtatgg attactctat tttatggtgc acgacgggct ggtgcatcaa 300 

cgttggccat tccgctatat tccacgcaag ggctacctca aacggttgta tatggcgcac 360 

cgtatgcatc acgccgtcag gggcaaagaa ggttgtgttt cttttggctt cctctatgcg 420 

ccgcccctgt caaaacttca ggcgacgctc cgggaaagac atggcgctag agcgggcgct 480 



gccagagatg cgcagggcgg ggaggatgag cccgcatccg ggaagtaa 



528 



<210> 72 
<211> 762 
<212> DNA 

<213> Nostoc sp. PCC73102 
<400> 72 

atgatccagt tagaacaacc actcagtcat caagcaaaac tgactccagt actgagaagt 60 
aaatctcagt ttaaggggct tttcattgct attgtcattg ttagcgcatg ggtcattagc 120 
ctgagtttat tactttccct tgacatctca aagctaaaat tttggatgtt attgcctgtt 180 
atactatggc aaacattttt atatacggga ttatttatta catctcatga tgccatgcat 240 
ggcgtagtat ttccccaaaa caccaagatt aatcatttga ttggaacatt gaccctatcc 300 



ctttatggtc ttttaccata tcaaaaacta ttgaaaaaac attggttaca ccaccacaat 360 
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ccagcaagct caatagaccc ggattttcac aatggtaaac accaaagttt ctttgcttgg 420 

tattttcatt ttatgaaagg ttactggagt tgggggcaaa taattgcgtt gactattatt 480 

tataactttg ctaaatacat actccatatc ccaagtgata atctaactta cttttgggtg 540 

ctaccctcgc ttttaagttc attacaatta ttctattttg gtactttttt accccatagt 600 

gaaccaatag ggggttatgt tcagcctcat tgtgcccaaa caattagccg tcctatttgg 660 

tggtcattta tcacgtgcta tcattttggc taccacgagg aacatcacga atatcctcat 720 

atttcttggt ggcagttacc agaaatttac aaagcaaaat ga 762 

<210> 73 
<211> 617 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 73 

tagggtgcgg aaccaggcac gctggtttca cacctcatgc ctgtgataag gtgtggctag 60 

agcgatgcgt gtgagacggg tatgtcacgg tcgactggtc tgatggccaa tggcatcggc 120 

catgtctggt catcacgggc tggttgcctg ggtgaaggtg atgcacatca tcatgtgcgg 180 

ttggaggggc tggcacagtg tgggctgaac tggagcagtt gtccaggctg gcgttgaatc 240 

agtgagggtt tgtgattggc ggttgtgaag caatgactcc gcccatattc tatttgtggg 300 

agctgagatg atggcatgct tgggatgtgc atggatcatg gtagtgcagc aaactatatt 360 

cacctagggc tgttggtagg atcaggtgag gccttgcaca ttgcatgatg tactcgtcat 420 

ggtgtgttgg tgagaggatg gatgtggatg gatgtgtatt ctcagacgta gaccttgact 480 

ggaggcttga tcgagagagt gggccgtatt ctttgagagg ggaggctcgt gccagaaatg 540 

gtgagtggat gactgtgacg ctgtacattg caggcaggtg agatgcactg tctcgattgt 600 
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aaaatacatt cagatgc 617 

<210> 74 
<211> 1208 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 74 

attgtgactg atagcgagac tctgggtcga tgttatctgc ctcaacaatg gcttagaaaa 60 

gaagaaacag aacaaataca gcaaggcaac gcccgtagcc taggtgatca aagactgttg 120 

ggcttgtctc tgaagcttgt aggaaaggca gacgctatca tggtgagagc taagaagggc 180 

attgacaagt tgccggcaaa ctgtcaaggc ggtgtacgag ctgcttgcca agtatatgct 240 

gcaattggat ctgtactcaa gcagcagaag acaacatatc ctacaagagc tcatctaaaa 300 

ggaagcgaac gtgccaagat tgctctgttg agtgtataca acctctatca atctgaagac 360 

aagcctgtgg ctctccgtca agctagaaag attaagagtt tttttgttga ttagtgaatt 420 

tttgttttat ttatgtctga tagttcaata aagagacaac acatacaata taaaatcatt 480 

gtctttaaat gttaatttag tagagtgtaa agcctgcatt ttttttgtac gcataaacaa 540 

tgaattcacc ccgcttctgg tttttaaata attatgtcaa actagggaaa attctttttt 600 

ttctcttcgt tctttttttg gcttgttgtg gagtcacagg cttgtcttca gattgataga 660 

ggttgtatac actcaacaga gcaatcttgg cacgttcgct tccttttaga tgagctcttg 720 

taggatatgt tgtcttctgc tgcttgagta cagatccaat tgcagcatat acttggcaag 780 

cagctcgtac accgccttga cagtttgccg gcaacttgtc aatgcccttc ttagctctca 840 

ccatgatagc gtctgccttt cctacaagct tcagagacaa gcccaacagt ctttgatcac 900 

ctaggctacg ggcgttgcct tgctgtattt gttctgtttc ttcttttcta agccattgtt 960 
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gaggcagata acatcgaccc aacatcctcg agccatacta cagcataaaa ggatacgttt 1020 

tctttaacag aaatttaccc ttttgttatc agcacataca aaaaaaaaga aatttaagat 1080 

gagtaggact tccattctct caaaaatttt attcaatcca taaatgaatt atttttggac 1140 

aaaaaagaaa gattatgcct gattttctct attttttttt tttttacaac tccaccaata 1200 

ctttctag 1208 



<210> 75 

<211> 6316 

<212> DNA 

<213> Blakeslea trispora 



<220> 

<221> misc_f eature 

<222> (2694) . . (2694) 

<223> n is a, c, g, or t 



<220> 

<221> misc_f eature " 

<222> (4263) . . (4263) 

<223> n is a, c, g, or t 



<400> 75 

aaggatgaag aatccaactc taataaaaat cttatggata tctttgatcg actcaaaaag 60 

gctttcaatg ctattgctat taaaaaaaaa gagagagaga gaactatgag caaaaggact 120 

ctatgccaag atggcaaaaa ggcaccagaa acccttagtt tattattgca taatccagtc 180 

gagctagtac ttctgtagct caagcttaac cgaggatctt ggaatcaact cgtctcgtca 240 

ctcttgccga tgatcctaga aatggtatct atggatgtta tactaacatt gttatctttc 300 



aaggcctcga agatgttatt gttgcggtga taaataggct gctatgtact gaagttgctc 360 
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tgtaaaatga atctagttca ctgcctactc agcaaatggt tgtttctaat gtctttaaag 420 

aaagaaaaaa agatacatat agactaccct tcctttcaag actgtaatcg agaatcggcc 480 

gatggtttat tacaattaga cgctgggaat aagcaaaagg attcatcttt gtaaataaga 540 

gactggtgca tatgaaagca aggatcgtat caaggaatag ttttgatcga gcatcaccag 600 

caaatgctgc taatgttggc ttcttctttg cttcctgaga ttgaatggga tgtgcctaga 660 

gcattgctat ttttaagtgt atactttaga tttgtgtctt tagatttgtg tcattttatt 720 

tagtcaagaa agatccccct ttctctatgt atgctaagaa gaaggagcaa gaagtgtatt 780 

tacaagttgg aatgagattg aaatattgta cataataata ataaaaagaa aggtagatca 84 0 

aaaaaaatgt tctgcctatt gtaagaaatc gggaccaaca ggtgcttgat aaccagaagt 900 

agcttccaat tcaggtagag gctctaggga caaatacaca attatgacag gaattttctt 960 

gttgacttga acactacaag agaaacgggt cagcacaaaa tccgaaaaaa aaaagaaacg 1020 

gaccattcat gtcttaccta tctagctctt tgtcttcaat tgcatcccat tgctcaacca 1080 

cagatacgct tcccaattga gtatattgat gaagtgttcc ctgcattttt cgcttgacta 1140 

attccactac agtcacagtc ttattaatgt tttgtccttt accagtcagg ataatatgat 1200 

ctttttgctt cttctatcaa aaaaataatt cttgttttga ataaaaaaaa caaatattta 1260 

aagaaactac tttgatgacg gtacctggaa taactcgaga cacacatcta catatgcgtt 1320 

gattttattg tggctaattc gaacctcatt ttctgctggt gggggctgtt gactttcagt 1380 

tgctgagacg tccttcttgc ttcttttata gtcttccact atgattttaa tcaagaaagt 1440 

aagtcagtga tgattgttac aagctatata tcttgaaaaa gaacagagag gtattattat 1500 

cagatgcaac atggttttct gtatcatttt catttcagtt tctctgttca aaaaaaaaaa 1560 

gaacactttc tctttccact cctcaaattt tttctgctaa actcctcgca aaacatgtat 1620 
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ttgctttaaa ctacaagttg caattgtctg atttagcaat ttcaatatgc cttttgtgaa 1680 

tccacccaaa aataaacaag tgcttgagta tacttgggtt cagttcaaaa gaaagcaagc 1740 

tttttttttt ctttcttggg aaagaaaaaa aaatattgtt gagccatcct ttaccagcag 1800 

tatgcgagct acgacatagc tggtctaaca atgactgcaa gcaatagatc gagcttagtc 1860 

tttctattgc ttcyttgttt gatctatgtt cggccttacg ctgacctatc caatactcga 1920 

gataggcaac aagatttcga acagtaatga aataaatttc ggataacagt tgtggatgag 1980 

gaagagaaag cgacttgaac tcgagaaact ttgttgaaat gaaatccgac cttttacgtg 2040 

atcatcatgt attatcctct ttttcttttt tttcgtagtg aattacttac tgattgcgct 2100 

caagtcgcgt ctttataaag aagaaaaaaa aatattagaa ctttcaaaaa atataactga 2160 

aaataaaagt gtggctcgga gagcaaatac cacatccttt gtcttcgctt tggtaacacg 2220 

gttaataagc cactataggt gaataatgat catttctgag aataaagcgc ggcttgaagc 2280 

ttatatccat atcaggattc atattaggca caactcacaa ttgaggttcc agaagtgcca 2340 

attttttttt cctgatagcc tgtccaatta agatcaaaaa ccactgagtt ttctctatat 2400 

attttttttt ttcataattc ttaactcttc ttcctctctc tctctctctc tctctttttg 2460 

gcttgcaaaa aaaatcttta gtaataccaa agaaagcaaa ccttttcctt ttcttatttc 2520 

cttgcttgtt ttttaatttt tgatttctct atgctttaaa tacccatttc tttctttctt 2580 

ctgctattac ctatcttttc attcctctcc cccctctctc tcttggtcta taaacatcat 2640 

gaagtcctct tttaaaagtt cgcttgacat ttatgctgtt tatatacagc atcntgtgtt 2700 

ttccaagtgg ttcattcttg cttttgttct ttcgattttc ctcaacactt atctactgaa 2760 

cgcttcgaag caacagccca aagtgataat caaaaaggtt attgagcggg tagaagtacc 2820 
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aagtagagaa caacctaaat cagtcataaa gccctcctcc aagaaacact cttctcatca 
tcagtctgat gtcattcgcc ctcttgatga agtattgggt ttgctcggaa cacccgaggc 
cttgactgat gaagagatca tctctattgt tcaagctggt aaaatggccc cctatgctct 
tgaaaaggtc ttgggcgatt tagagcgcgc tgtccatatc cgtcgtgctt tgatctcccg 
tgactctcgt acgaaaactt tggaagacag tatgcttccc gtgaaaaact atcattatga 
taaagtcatg ggtgcttgtt gtgaaaatgt cattggttat atgcctattc cagtaggtgt 
cgcaggtaag aagttcaaca agtcgcgata tttgacaagt tgctcatcat tttcgaaaca 
ggtcctttgg tgattgatgg tgattctatt catattccca tggcaactac ggaaggttgt 
ttagttgctt ctactgccag aggttgtaaa gcaatcaatg ctggtggtgg tgccaacaca 
attgttgttg ctgatggtat gactcgaggt ccttgtgtcg aatttcctac aatcactcgc 
gctgctgact gtaaacgatg gattgaacaa gagggtgaag ctatcgtgac cgaggcattc 
aattcaactt ctcgttttgc tcgtgttcgt aaattgaaag ttgctcttgc cggtcgtcta 
gtctacatcc gtttctctac cactacaggt gatgcaatgg gcatgaacat gatctccaag 
ggttgtgaaa aggctttaag caagattgct gagagatatc ctgatatgca gatcatttct 
ctttctggta actattgtac tgacaagaaa cctgctgcta tcaactggat tgaaggacgt 
ggtaaatctg ttgttgctga sgctgtcatc cctggtacgg ttgtcgaaaa ggtattgaag 
acctctgtta gtgctttggt tgagctgaac atctctaaaa acctggttgg ttctgctatg 
gctggctccg tcggtggctt taacgctcat gctgctaata ttctaactgc catttacctt 
gctactggtc aagatcctgc tcaaaatgta sagagttcta actgtattac tttgatgaaa 
gctgtcaatg gcgaaagaga ccttcatatc tcttgtacaa tgccctgtat tgaagtaggc 
accattggtg gtggtactat tttgcctcct caacaagcca tgttggattt cattggtgtg 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
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cgtggtcctc accctaccga acctggtgcc aatgcccgwc gccttgctcg tgttatctgt 4140 

gcctctgtga tggctggtga attgtcttta tgtgcagctt tggctgctgg tcatcttgta 4200 

aaggcacaca tggctcataa tcgtaatacc actgctgctg ccgctgttgt tcctgcccct 4260 

aanggcatag ttgatgtctc tacacctcct gctacacctg cagaaaagaa tgatcctatt 4320 

cctggaagtt gtatcaagtc atagaattaa tattatatat atatcatata caaaaaaaag 4380 

aaaaaaaaaa cactacatct atttatattt ctccatgtac acacacacac acacatataa 4440 

aaactcttta ttttccaata ttttgctttt ataaataatc ttatttcatt ctaaataaac 4500 

tgtttttttt tattaatcat caaaccctgc tgagagctgt gcaatatcat ctatgttttc 4560 

atggtttaac tctggtatcg gwcgagcctc ctctgtactt gaagtttgta ggcagttttt 4 620 

atttaaggct gctggtcgat catgatcatc akcaaacctg acagcatgaa gttttgactg 4680 

atgagcaatt tcactaaggg cagaatctga actctttcgc ttcctactat tgaccatatt 4740 

gtctttaggt ggaatgagtg aatagcgtct tgtcatatgt aacacagaat caacaatatc 4800 

ctggtgatga aactcggcca aacatagcgc ctttctcccc caacaattat aataatcaaa 4860 

atgagaatga catgtacggt tttcctcgat gacaatatcc aacgtcttgt cataatcctc 4 920 

tgtgcgyata ccattcatct tttggaagaa cgcacggtag ctctcacaag ctgtcctcag 4 980 

agagttccgt gccatgtttc ccaatgctcc tggcaagtcg aaatgaagtt gtcgaatctg 5040 

gcgatgtatg tctacaatgt cgcctgtttc tttcattaga tcaagcattc gtgtagccca 5100 

aatgatgtct atgttatgat tttctttcat tccagtaata actatagttt ctcggcaaat 5160 

cgaatgastg atggagtaaa ttcatcaaaa gtgcaagtaa tacatacagt gcttgaagaa 5220 

atcttgtgta gcacgcctat attatgtaat ataggatcga ttctcgaaac tcgacataac 5280 
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caccaggctt tagcaagcgt tttatttcat tcatgacaag ctattgttaa ttcytgctta 5340 

ataaaacaaa atgaaaaaaa catacccccc tcmaaactta cttcccactc ttgattggaa 5400 

aaacaggtat agacgtgacg catatgtata taatcaaaac actcatcagg atagggtaaa 54 60 

ccattgagca catcgcattg ggtgaagaaa gtattaggag gcttgatggc tgtaggatat 5520 

ataggtgcaa tatcaatacc gtaaaactca gcatttggga attctgtagc catctccaga 5580 

atccaagtac ctgtgccaca agcaacatca agcactttag gtaagggtat acattgttgt 5640 

tcttgttgtt gttgttgaca atcacttgag tctgagtttc gttttgattg ttttaatgac 5700 

aataattctt ttacaggtgc tgagaaatta ccgtcaaata gatacttgta aataaaatgc 5760 

taaaaataaa aacaatagaa aaaaaaattg acgctcattt cattactatg gaaataactg 5820 

caaaatctta ccacttgtac aagtctatct tgctcaatct catcgtttgg cagaatgtat 5880 

ttattgttgt agtattgata tcttctacca ttcatgatat aactgtcgct tctaatgctc 5940 

tgaggtgaag tacttgtagg tgaaggtgga agtgacgcaa ttttgtcaag cttaacagga 6000 

tcctctcggc tacatgtttrt ctgcatatca ggaaaatctt gtttatttga aacatcaaca 6060 

gtagatgtgg tgtgatcttt tttgaaaata tcgatgcctt cctttgaaag ccttttgaaa 6120 

ggctctttta acttttttga gtgagagcta cccatgatag cttatgaaga attaaaaaga 6180 

aaaaagcaaa aaaaattaaa aaaaaaaaaa gtagcaaaaa attctgtcgt aattatacaa 6240 

gccaatcaaa atcgaaattc atgcaaggca tagatgttca cgtggatttg atggttgatc 6300 

cttttttttt gcaaga 6316 



<210> 76 

<211> 1170 

<212> DNA 

<213> Thermus thermophilus 
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<400> 76 

atgaagcgcc tttccctgag ggaggcctgg ccctacctga aagacctcca gcaagatccc 60 

ctcgccgtcc tgctggcgtg gggccgggcc cacccccggc tcttccttcc cctgccccgc 120 

ttccccctgg ccctgatctt tgaccccgag ggggtggagg gggcgctcct cgccgagggg 180 

accaccaagg ccaccttcca gtaccgggcc ctctcccgcc tcacggggag gggcctcctc 240 

accgactggg gggaaagctg gaaggaggcg cgcaaggccc tcaaagaccc cttcctgccg 300 

aagaacgtcc gcggctaccg ggaggccatg gaggaggagg cccgggcctt cttcggggag 360 

tggcgggggg aggagcggga cctggaccac gagatgctcg ccctctccct gcgcctcctc 420 

gggcgggccc tcttcgggaa gcccctctcc ccaagcctcg cggagcacgc ccttaaggcc 480 

ctggaccgga tcatggccca gaccaggagc cccctggccc tcctggacct ggccgccgaa 540 

gcccgcttcc ggaaggaccg gggggccctc taccgcgagg cggaagccct catcgtccac 600 

ccgcccctct cccaccttcc ccgagagcgc gccctgagcg aggccgtgac cctcctggtg 660 

gcgggccacg agacggtggc gagcgccctc acctggtcct ttctcctcct ctcccaccgc 720 

ccggactggc agaagcgggt ggccgagagc gaggaggcgg ccctcgccgc cttccaggag 780 

gccctgaggc tctacccccc cgcctggatc ctcacccgga ggctggaaag gcccctcctc 840 

ctgggagagg accggctccc cccgggcacc accctggtcc tctcccccta cgtgacccag 900 

aggctccact tccccgatgg ggaggccttc cggcccgagc gcttcctgga ggaaaggggg 960 

accccttcgg ggcgctactt cccctttggc ctggggcaga ggctctgcct ggggcgggac 1020 

ttcgccctcc tcgagggccc catcgtcctc agggccttct tccgccgctt ccgcctagac 1080 

cccctcccct tcccccgggt cctcgcccag gtcaccctga ggcccgaagg cgggcttccc 1140 

gcgcggccta gggaggaggt gcgggcgtga 1170 



BASF AG 

BASF NAE 877/03 



361/365 



January 08, 2004 



<210> 77 
<211> 2981 
<212> DNA 

<213> Blakeslea trispora 
<400> 77 

tctagaattc attccattcg aaaggatcaa cataaccaat ttaatgacta ctagctaatg 60 

gatacaaata tacgcacaaa aaaagaaaga attctatgat caaagagaac acagacacag 120 

agtgatacat ttaaatggtt aagttcttat gatgttaaaa tggtaacttt attattgaat 180 

taaatgcgaa tatcgttgct gctttgtact tggaaaacgt taggtaaaag ttggttaatg 240 

aaagaagcag gagttgtagt atcatctctt gggaagaaat agaaaaagag gaaagtaaca 300 

aagtaacaag caagacaata atagatccaa tggctttcgg tcttacgagt ttgttcagga 360 

gcatacttct tttggctatc ttgtaacttt cttggtaagg gattctggcc aaagctttta 420 

cagacttggt cggaagtaag cttacttcca gcaagaacga taggaacacc agtacctgga 480 

tgtgtactac aaagaaaaga gaaatgagta cgtgcgttat taaaaaaaag aaaaaaagag 540 

ggcaaaagta ttacctagct ccgacaaaga aaagattatc ataacggttt gtggaatcct 600 

tggtactagg tctgaaccag agaacttgga acacatcatg agaaagacca agaatagaac 660 

ctctccaaag gttaaacttg ctttgccaaa cactaggatc attcacttct tcatgttcaa 720 

tcaaattagc aaagttgttt actcccaaac gacgttcgat aacttccaga accatcttgc 780 

gtgcacggtt taccaactca ggataatttt cttcagcact gtttcctgtc ttactcttca 840 

tatggccaat tggaaccaac acaataatgg agtccttgtt gggaggtgcg gcagattcat 900 

caattcgaga tggaacgttg acatagaatg aagcttcaga gggcaaaccg aagtcgttga 960 

aaatctcatc aaaactttcc ttgtaggctt cagccaagaa gatattgtgt acgtctaatt 1020 
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gaggcacctt tgttgacatg gaccaataaa acgaaataga tgatgaagtg agtttctttg 1080 

aggctaatgt cttctttgtc caattgcaag gaggtaacag atggtgataa gcataaacaa 1140 

gatccgcatt acatacgact gcatcggctt caatgacttc tccgctttcc aaagtgacac 1200 

cggttacacg cttgtcttta tcgacagtgt taattttagc aacaggcgat tgatatctga 1260 

attcagcacc gtactttttg gaggcgatag actcaagctt ctgaacaacc atgttgaaac 1320 

caccacgagg ataccagata ccttcagcaa actcggtgta ttgtaacaaa ctgtaaactg 1380 

ctggagcatc ataaggcgac atactatatt ccaaaaatag aaaatagaac aatgaatatc 1440 

aaaattcctt tcacttgccc tttttcacat ttctcttttc ccacccccga ccggtctcac 1500 

tcattttttt ttcatcccac accacgcgtt gtatgtgtac ttaccccata tacattgttt 1560 

gaaaagtaaa agccatacgc attttcttgg tttggaaata tttactggct cggtcataga 1620 

tcttaccaaa caagtgcaag cgaaagattt caggcacata ctgaagacga atcaaatccc 1680 

aaatggtttc aaagttgcgc ttgatagcaa taaatgtacc ttgttcataa tggacatgtg 1740 

tttccttcat gaaatccaag aatctaccaa atccaagggg accctcaata cggtccaatt 1800 

cgcccttcat cttggttaaa tcggaagaga gttgtacggc atcaccgtcg tcaaaatgaa 1860 

ccttatagtt attgtcacag cgaagcaaat ccaaatgatc accaatacgt tcatccaaat 1920 

cagcaaatgc atcttcaaaa agcttaggca tcaaatagag tgagggaccc tgatcaaagc 1980 

gatgaccatc gtgatgaatg aatgaacaac ggccaccgga aaagtcgttc ttttcaacaa 2040 

cagtaactcg aaaaccttca cgagcaagac gagcagcagt agcagttccg ccaataccgg 2100 

caccaatgac aacaatatgc ttcttttgat cagacatgag attaaaatag ataaggaaaa 2160 

gaaagtgaaa agaaattcgg aagcatggca cattcttctt tttataaata catgcctgac 2220 
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tttctttttc catcgatatg atatatgcat atgatagata tacaagcaat cttcttcaag 2280 

gagtttgaaa ttttgtcctc caggagcaaa aaaaagtttt tttttataca tgtttgtaca 2340 

caagaatagt taccaatttg ctttggtctt acgtgctgca agtttatatc gttttcaatt 2400 

tctttgtctt tacattttct ttgtccttta tctttcctca tttagtcttt gggagaatta 2460 

ggaaaaggga gcggaaaggt aagaaatgct tgcgtatttt actaattcgg caaacatcca 2520 

atttggcaaa cagcagcctg tgcaacgctc tcgagatgac agtatctttg attacactct 2580 

aaatctcgat gacccgacca aaaagagcga acaaagaaat aatcttgtgc attcgaatat 2640 

gatggaagat tttttccccc ttattctaaa tgttgacata gcgtgtatgt tatataaaca 2700 

aaaagaaatt gtacaaactt tcttttcttc tctttttatt ttatctctat gtcaatactc 2760 

acttatctgg aatttcatct ctactataca ctacctgtcc ttgcggcatt gtgttggctg 2820 

ctaaagccgt ttcactcaca gcaagacaat ctcaagtata aatttttaat gttgatggcc 2880 

gcctctaccg catcgatttg ggacaattat atcgtttatc atcgcgcttg gtggtactgt 2940 

cctacttgtg ttgtggctg~t cattggctat gtacctctag a 2981 

<210> 78 
<211> 1749 
<212> DNA 

<213> Blakeslea trispora 
<400> 78 

atgtctgatc aaaagaagca tattgttgtc attggtgccg gtattggcgg aactgctact 60 

gctgctcgtc ttgctcgtga aggttttcga gttactgttg ttgaaaagaa cgacttttcc 120 

ggtggccgtt gttcattcat tcatcacgat ggtcatcgct ttgatcaggg tccctcactc 180 

tatttgatgc ctaagctttt tgaagatgca tttgctgatt tggatgaacg tattggtgat 240 
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catttggatt tgcttcgctg tgacaataac tataaggttc attttgacga cggtgatgcc 300 

gtacaactct cttccgattt aaccaagatg aagggcgaat tggaccgtat tgagggtccc 360 

cttggatttg gtagattctt ggatttcatg aaggaaacac atgtccatta tgaacaaggt 420 

acatttattg ctatcaagcg caactttgaa accatttggg atttgattcg tcttcagtat 480 

gtgcctgaaa tctttcgctt gcacttgttt ggtaagatct atgaccgagc cagtaaatat 540 

ttccaaacca agaaaatgcg tatggctttt acttttcaaa caatgtatat gggtatgtcg 600 

ccttatgatg ctccagcagt ttacagtttg ttacaataca ccgagtttgc tgaaggtatc 660 

tggtatcctc gtggtggttt caacatggtt gttcagaagc ttgagtctat cgcctccaaa 720 

aagtacggtg ctgaattcag atatcaatcg cctgttgcta aaattaacac tgtcgataaa 780 

gacaagcgtg taaccggtgt cactttggaa agcggagaag tcattgaagc cgatgcagtc 840 

gtatgtaatg cggatcttgt ttatgcttat caccatctgt tacctccttg caattggaca 900 

aagaagacat tagcctcaaa gaaactcact tcatcatcta tttcgtttta ttggtccatg 960 

tcaacaaagg tgcctcaatt agacgtacac aatatcttct tggctgaagc ctacaaggaa 1020 

agttttgatg agattttcaa cgacttcggt ttgccctctg aagcttcatt ctatgtcaac 1080 

gttccatctc gaattgatga atctgccgca cctcccaaca aggactccat tattgtgttg 1140 

gttccaattg gccatatgaa gagtaagaca ggaaacagtg ctgaagaaaa ttatcctgag 1200 

ttggtaaacc gtgcacgcaa gatggttctg gaagttatcg aacgtcgttt gggagtaaac 1260 

aactttgcta atttgattga acatgaagaa gtgaatgatc ctagtgtttg gcaaagcaag 1320 

tttaaccttt ggagaggttc tattcttggt ctttctcatg atgtgttcca agttctctgg 1380 

ttcagaccta gtaccaagga ttccacaaac cgttatgata atcttttctt tgtcggagct 1440 

agtacacatc caggtactgg tgttcctatc gttcttgctg gaagtaagct: tacttccgac 1500 
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caagtctgta aaagctttgg ccagaatccc ttaccaagaa agttacaaga tagccaaaag 1560 

aagtatgctc ctgaacaaac tcgtaagacc gaaagccatt ggatctatta . ttgtcttgct 1620 

tgttactttg ttactttcct ctttttctat ttcttcccaa gagatgatac tacaactcct 1680 

gcttctttca ttaaccaact tttacctaac gttttccaag tacaaagcag caacgatatt 1740 



<210> 79 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 

<400> 79 

ccgatggcga cgacggaagg ttgtt 25 

<210> 80 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer 



cgcatttaa 



1749 



<400> 



80 



catgttcatg cccattgcat cacct 



25 



